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TITLE ---- 
TRIACYLGLYCEROL LIPASES 
This application claims the benefit of U.S. Provisional Application No. 60/083,688. 
filed April 30, 1998. 

FIELD OF THE INVENTION 
This invention is in the field of plant molecular biology. More specifically, this 
invention pertains to nucleic acid fragments encoding triacylglycerol lipases in plants and 
seeds. 

BACKGROUND OF THE INVENTION 

True lipases attach triacylglycerols and act at an oil-water interface; they constitute a 
ubiquitous group of enzymes catalyzing a wide variety of reactions, many with industrial 
potential. Triacylglycerol lipases catalyze the transformation of triacylglycerol and water 
into diacylglycerol and a fatty acid anion. Human gastric lipase, rat lingual lipase, and 
human hepatic lysosomal lipase amino acid sequences are homologous but are unrelated to 
porcine pancreatic lipase apart from a 6 amino-acid sequence around the essential Ser-152 of 
porcine pancreatic lipase (Bodmer. M. W. (1987) Biochim Biophys Acta 909:237-244). 
These enzymes are glycosylated, contain a hydrophobic signal peptide, and belong to a gene 
family of acid lipases (Ameis, D. et al. (1994) EurJBiochem 279:905-914). Lysosomal acid 
lipase (LAL) is a hydrolase essential for the intracellular degradation of cholesteryl esters 
and triacylglycerols and participates in the mobilization of seed oil during germination. No 
plant triacylglycerol lipase cDNAs of this class are currently listed in GenBank. 

Neutral triacylglycerol lipases have been widely studied in fungi, bacteria, mammals, 
and insects. Nucleotide sequences with similarities to neutral triacylglycerol lipases in 
Arabidopsis thaliana and lpomea nil have been described but their function has not yet been 
proven. The X-ray structure of the Mucor miehei triglyceride lipase has been reported, 
revealing a Ser. . .His... Asp trypsin-like catalytic triad with an active serine buried under the 
short helical fragment of a long surface loop (Brady, L. et al. (1990) Nature 343:161 -110). 

It may be useful to isolate triacylglycerol lipase cDNAs from plants that accumulate 
large amounts of fatty acids with unusual structures. Lacking this ability could be a possible 
limitation in development of transgenic crops with novel seed oils. Triacylglycerol lipases 
may also be useful in processing of plant seed oils. Lysosomal acid lipase (LAL) may be 
used to engineer lipid and cholesteryl ester metabolism and/or lysosome function. 

SUMMARY OF THE INVENTION 

The instant invention relates to isolated nucleic acid fragments encoding 
triacylglycerol lipases. Specifically, this invention concerns an isolated nucleic acid 
fragment encoding an acid or a neutral triacylglycerol lipase. In addition, this invention 
relates to a nucleic acid fragment that is complementary to the nucleic acid fragment 
encoding an acid or a neutral triacylglycerol lipase. 
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An additional embodiment of the instant invention pertains to a polypeptide encoding^ 
all or a substantial portion of a triacylglycerol lipase selected from the group consisting of 
acid and neutral triacylglycerol lipases. 

In another embodiment, the instant invention relates to a chimeric gene encoding an 
5 acid or a neutral triacylglycerol lipase, or to a chimeric gene that comprises a nucleic acid 
fragment that is complementary to a nucleic acid fragment encoding an acid or a neutral 
triacylglycerol lipase, operably linked to suitable regulatory sequences, wherein expression 
of the chimeric gene results in production of levels of the encoded protein in a transformed 
host cell that is altered (i.e., increased or decreased) from the level produced in an 
10 untransformed host cell. 

In a further embodiment, the instant invention concerns a transformed host cell 
comprising in its genome a chimeric gene encoding an acid or a neutral triacylglycerol 
lipase, operably linked to suitable regulatory sequences. Expression of the chimeric gene 
results in production of altered levels of the encoded protein in the transformed host cell. 
15 The transformed host cell can be of eukaryotic or prokaryotic origin, and include cells 
derived from higher plants and microorganisms. The invention also includes transformed 
plants that arise from transformed host cells of higher plants, and seeds derived from such 
transformed plants. 

An additional embodiment of the instant invention concerns a method of altering the 
20 level of expression of an acid or a neutral triacylglycerol lipase in a transformed host cell 
comprising: a) transforming a host cell with a chimeric gene comprising a nucleic acid 
fragment encoding an or a neutral acid triacylglycerol lipase; and b) growing the 
transformed host cell under conditions that are suitable for expression of the chimeric gene 
wherein expression of the chimeric gene results in production of altered levels of acid or 
25 neutral triacylglycerol lipase in the transformed host cell. 

An addition embodiment of the instant invention concerns a method for obtaining a 
nucleic acid fragment encoding all or a substantial portion of an amino acid sequence 
encoding an acid or a neutral triacylglycerol lipase. 

BRIEF DESCRIPTION OF THE 
30 DRAWINGS AND SEQUENCE DESCRIPTIONS 

The invention can be more fully understood from the following detailed description 
and the accompanying drawings and Sequence Listing which form a part of this application. 

Figure 1 depicts the amino acid sequence alignment between the acid triacylglycerol 
lipase from rice clone rlr72.pk0015.b2 (SEQ ID NO: 14), soybean contig assembled from 
35 clones sdp3c.pk004.n3 and ssl.pk0022.al (SEQ ID NO: 1 8), soybean contig assembled from 
clones slslc.pk009.o2, srrlc.pk001.ml9 and sre.pk0004.d7 (SEQ ID NO:20), Canis 
familiaris (NCBI General Identifier No. 3041702, SEQ ID NO:35) and Caenorhabditis 
elegans (NCBI General Identifier No. 3 1 65581 , SEQ ID NO:36). Amino acids which are 
conserved among all sequences are indicated with an asterisk (*) while amino acids 
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conserved only among plant sequences are indicated by a plus sign (+). Dashes aire used by 
the program to maximize alignment of the sequences. 

The following sequence descriptions and Sequence Listing attached hereto comply 
with the rules governing nucleotide and/or amino acid sequence disclosures in patent 
5 applications as set forth in 37 C.F.R. §1 .82 1 -1 .825. 

SEQ ID NO:l is the nucleotide sequence comprising the entire cDNA insert in clone 
cen3n.pk0129.e9 encoding a portion of a corn acid triacylglycerol lipase. 

SEQ ID NO:2 is the deduced amino acid sequence of a portion of a corn acid 
triacylglycerol lipase derived from the nucleotide sequence of SEQ ID NO: 1 . 
10 SEQ ID NO:3 is the nucleotide sequence comprising the 3' 647 nucleotides from the 

cDNA insert in clone ncs.pk0013.hl encoding the C-terminal quarter of a Catalpa acid 
triacylglycerol lipase 

SEQ ID NO:4 is the deduced amino acid sequence of the C-terminal quarter of a 
Catalpa acid triacylglycerol lipase derived from the nucleotide sequence of SEQ ID NO:3. 
15 SEQ ID NO:5 is the nucleotide sequence comprising the 5* 705 nucleotides from the 

cDNA insert in clone ncs.pk0013.hl encoding the N-terminal third of a Catalpa acid 
triacylglycerol lipase. 

SEQ ID NO:6 is the deduced amino acid sequence of the N-terminal third of a Catalpa 
acid triacylglycerol lipase derived from the nucleotide sequence of SEQ ID NO:5. 
20 SEQ ID NO:7 is the nucleotide sequence comprising the contig assembled from a 

portion of the cDNA insert in clones p0075.cslag33r, p0126.cnlay46r and p0014.ctuty54r 
encoding a substantial portion of a corn acid triacylglycerol lipase. 

SEQ ID NO:8 is the deduced amino acid sequence of a substantial portion of a corn 
acid triacylglycerol lipase derived from the nucleotide sequence of SEQ ID NO:7. 
25 SEQ ID NO:9 is the nucleotide sequence comprising a portion of the cDNA insert in 

clone p0102xeral64r encoding a portion of a corn acid triacylglycerol lipase. 

SEQ ID NO: 10 is the deduced amino acid sequence of a portion of a com acid 
triacylglycerol lipase derived from the nucleotide sequence of SEQ ID NO:9. 

SEQ ID NO:l 1 is the nucleotide sequence comprising a portion of the cDNA insert in 
30 clone pOl 26.cnlcm37r encoding a portion of a corn acid triacylglycerol lipase. 

SEQ ID NO: 12 is the deduced amino acid sequence of a portion of a com acid 
triacylglycerol lipase derived from the nucleotide sequence of SEQ ID NO:l 1. 

SEQ ID NO: 13 is the nucleotide sequence comprising the entire cDNA insert in clone 
rlr72.pk0015.b2 encoding an entire rice acid triacylglycerol lipase. 
35 SEQ ID NO: 14 is the deduced amino acid sequence of an entire rice acid 

triacylglycerol lipase derived from the nucleotide sequence of SEQ ID NO: 13. 

SEQ ID NO: 15 is the nucleotide sequence comprising a portion of the cDNA insert in 
clone rslln.pk012.h7 encoding a portion of a rice acid triacylglycerol lipase. 


3 


WO 99/55883 


PCT/US99/09280 


SEQ ID NO: 1 6 is the deduced amino acid sequence of a portion of a rice acid 
triacyiglycerol lipase derived from the nucleotide sequence of SEQ ID NO: 1 5. 

SEQ ID NO: 17 is the nucleotide sequence comprising the contig assembled from the 
entire cDNA insert in clone ssl.pk0022.al and a portion of the cDNA insert in clone 
5 sdp3c.pk004.n3 encoding an entire soybean acid triacyiglycerol lipase. 

SEQ ID NO: 18 is the deduced amino acid sequence of an entire soybean acid 
triacyiglycerol lipase derived from the nucleotide sequence of SEQ ID NO: 1 7. 

SEQ ID NO: 19 is the nucleotide sequence comprising the contig assembled from the 
entire cDNA insert in clone sre.pk0004.d7 and a portion of the cDNA insert in clones 
10 slslc.pk009.o2 and srrlc.pk001.ml9 encoding an entire soybean acid triacyiglycerol lipase. 

SEQ ID NO:20 is the deduced amino acid sequence of an entire soybean acid 
triacyiglycerol lipase derived from the nucleotide sequence of SEQ ID NO: 19. 

SEQ ID NO:21 is the nucleotide sequence comprising the entire cDNA insert in clone 
crln.pk0145.c6 encoding half of a corn neutral triacyiglycerol lipase. 
15 SEQ ID NO:22 is the deduced amino acid sequence of half of a corn neutral 

triacyiglycerol lipase derived from the nucleotide sequence of SEQ ID NO:21. 

SEQ ID NO:23 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA insert in clones p0010.cbpbe40r, p0083.cldcql7r, p0048.cqlac25r. 
pOl 18xhsbw59r, crl.pkOOl 1x9 and cdolc.pk002.c22 encoding an entire corn neutral 
20 triacyiglycerol lipase. 

SEQ ID NO:24 is the deduced amino acid sequence of an entire corn neutral 
triacyiglycerol lipase derived from the nucleotide sequence of SEQ ID NO:23. 

SEQ ID NO:25 is the nucleotide sequence comprising the contig assembled from the 
entire cDNA insert in clone crln.pk0127.h8 and a portion of the cDNA insert in clones 
25 p0037xrwan02r, p0004.cblfm22r, p0004.cblei43r ? ccoln.pk068.o9 and p0093.cssao39r 
encoding most of a corn neutral triacyiglycerol lipase. 

SEQ ID NO:26 is the deduced amino acid sequence of most of a corn neutral 
triacyiglycerol lipase derived from the nucleotide sequence of SEQ ID NO:25. 

SEQ ID NO:27 is the nucleotide sequence comprising a portion of the cDNA insert in 
30 clone rdrlf.pk002.fl 1 encoding a portion of a rice neutral triacyiglycerol lipase. 

SEQ ID NO:28 is the deduced amino acid sequence of a portion of a rice neutral 
triacyiglycerol lipase derived from the nucleotide sequence of SEQ ID NO:27. 

SEQ ID NO:29 is the nucleotide sequence comprising the contig assembled from the 
entire cDNA insert in clone sre.pk0058.bl and a portion of the cDNA insert in clone 
35 sahlc.pk001.k20 encoding a substantial portion of a soybean neutral triacyiglycerol lipase. 

SEQ ID NO:30 is the deduced amino acid sequence of a substantial portion of a 
soybean neutral triacyiglycerol lipase derived from the nucleotide sequence of SEQ ID 
NO:29. 
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SEQ ID NO:31 is the nucleotide sequence comprising the entire cDNA insert in clon? 
sri .pk0079.el encoding the C-terminal half of a soybean neutral triacyl glycerol lipase. 

SEQ ID NO:32 is the deduced amino acid sequence of the C-terminal half of a 
soybean neutral triacylglycerol lipase derived from the nucleotide sequence of SEQ ID 
5 NO:31. 

SEQ ID NO:33 is the nucleotide sequence comprising the entire cDNA insert in clone 
wrl .pkOl 1 5.f5 encoding a portion of a wheat neutral triacylglycerol lipase. 

SEQ ID NO:34 is the deduced amino acid sequence of a portion of a wheat neutral 
triacylglycerol lipase derived from the nucleotide sequence of SEQ ID NO:33. 

10 SEQ ID NO:35 is the amino acid sequence of a Cams familiaris acid triacylglycerol 

lipase, NCBI General Identifier No. 3041 702. 

SEQ ID NO:36 is the amino acid sequence of a Caenorhabditis elegans acid 
triacylglycerol lipase, NCBI General Identifier No. 3 1 65581 . 

The Sequence Listing contains the one letter code for nucleotide sequence characters 

15 and the three letter codes for amino acids as defined in conformity with the IUPAC-IUBMB 
standards described in Nucleic Acids Research 75:3021-3030 (1985) and in the Biochemical 
Journal 219 (No. 2;:345-373 (1984) which are herein incorporated by reference. The 
symbols and format used for nucleotide and amino acid sequence data comply with the rules 
set forth in 37 C.F.R. §1.822. 

20 DETAILED DESCRIPTION OF THE INVENTION 

In the context of this disclosure, a number of terms shall be utilized. As used herein, 
an "isolated nucleic acid fragment" is a polymer of RNA or DNA that is single- or double- 
stranded, optionally containing synthetic, non-natural or altered nucleotide bases. An 
isolated nucleic acid fragment in the form of a polymer of DNA may be comprised of one or 

25 more segments of cDNA, genomic DNA or synthetic DNA. As used herein. "contig" refers 
to an assemblage of overlapping nucleic acid sequences to form one contiguous nucleotide 
sequence. For example, several DNA sequences can be compared and aligned to identify 
common or overlapping regions. The individual sequences can then be assembled into a 
single contiguous nucleotide sequence. 

30 As used herein, "substantially similar" refers to nucleic acid fragments wherein 

changes in one or more nucleotide bases results in substitution of one or more amino acids, 
but do not affect the functional properties of the protein encoded by the DNA sequence. 
"Substantially similar" also refers to nucleic acid fragments wherein changes in one or more 
nucleotide bases does not affect the ability of the nucleic acid fragment to mediate alteration 

35 of gene expression by antisense or co-suppression technology, "Substantially similar" also 
refers to modifications of the nucleic acid fragments of the instant invention such as deletion 
or insertion of one or more nucleotides that do not substantially affect the functional 
properties of the resulting transcript vis-a-vis the ability to mediate alteration of gene 
expression by antisense or co-suppression technology or alteration of the functional 
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properties of the resulting protein molecule. It is therefore understood that the invention ^ 
encompasses more than the specific exemplary sequences. 

For example, it is well known in the art that antisense suppression and co-suppression 
of gene expression may be accomplished using nucleic acid fragments representing less than 
5 the entire coding region of a gene, and by nucleic acid fragments that do not share 1 00% 
sequence identity with the gene to be suppressed. Moreover, alterations in a gene which 
result in the production of a chemically equivalent amino acid at a given site, but do not 
effect the functional properties of the encoded protein, are well known iisAhe art. Thus, a 
codon for the amino acid alanine, a hydrophobic amino acid, may be substituted by a codon 

10 encoding another less hydrophobic residue, such as glycine, or a more hydrophobic residue, 
such as valine, leucine, or isoleucine. Similarly, changes which result in substitution of one 
negatively charged residue for another, such as aspartic acid for glutamic acid, or one 
positively charged residue for another, such as lysine for arginine, can also be expected to 
produce a functionally equivalent product. Nucleotide changes which result in alteration of 

15 the N-terminal and C-terminal portions of the protein molecule would also not be expected 
to alter the activity of the protein. Each of the proposed modifications is well within the 
routine skill in the art. as is determination of retention of biological activity of the encoded 
products. Moreover, substantially similar nucleic acid fragments may also be characterized 
by their ability to hybridize, under stringent conditions (0.1X SSC, 0.1% SDS, 65°C), with 

20 the nucleic acid fragments disclosed herein. 

Substantially similar nucleic acid fragments of the instant invention may also be 
characterized by the percent similarity of the amino acid sequences that they encode to the 
amino acid sequences disclosed herein, as determined by algorithms commonly employed by 
those skilled in this art. Preferred are those nucleic acid fragments whose nucleotide 

25 sequences encode amino acid sequences that are 80% similar to the amino acid sequences 
reported herein. More preferred nucleic acid fragments encode amino acid sequences that 
are 90% similar to the amino acid sequences reported herein. Most preferred are nucleic 
acid fragments that encode amino acid sequences that are 95% similar to the amino acid 
sequences reported herein. Sequence alignments and percent similarity calculations were 

30 performed using the Megalign program of the LASARGENE bioinformatics computing suite 
(DNASTAR Inc., Madison, Wl). Multiple alignment of the sequences was performed using 
the Clustal method of alignment (Higgins, D. G. and Sharp, P. M. (1989) CABIOS. 
5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH 
PENALTY=10). Default parameters for painvise alignments using the Clustal method were 

35 KTUPLE 1 , GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5. 

A "substantial portion" of an amino acid or nucleotide sequence comprises enough of 
the amino acid sequence of a polypeptide or the nucleotide sequence of a gene to afford 
putative identification of that polypeptide or gene, either by manual evaluation of the 
sequence by one skilled in the art, or by computer-automated sequence comparison and 
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identification using algorithms such as BLAST (Basic Local Alignment Search Tool; 
Altschul, S. F., et al. (1993) 1 Mol Biol 275:403-410; see also 

www.ncbi.nlm.nih.gov/BLAST/). In general, a sequence often or more contiguous amino 
acids or thirty or more nucleotides is necessary in order to putatively identify a polypeptide 
5 or nucleic acid sequence as homologous to a known protein or gene. Moreover, with respect 
to nucleotide sequences, gene specific oligonucleotide probes comprising 20-30 contiguous 
nucleotides may be used in sequence-dependent methods of gene identification (e.g., 
Jj8ythern hybridization) and isolation (e.g., in situ hybridization of bacterial colonies or 
bacteriophage plaques). In addition, short oligonucleotides of 12-15 bases may be used as 

10 amplification primers in PCR in order to obtain a particular nucleic acid fragment 

comprising the primers. Accordingly, a "substantial portion" of a nucleotide sequence 
comprises enough of the sequence to afford specific identification and/or isolation of a 
nucleic acid fragment comprising the sequence. The instant specification teaches partial or 
complete amino acid and nucleotide sequences encoding one or more particular plant 

15 proteins. The skilled artisan, having the benefit of the sequences as reported herein, may 
now use all or a substantial portion of the disclosed sequences for purposes known to those 
skilled in this art. Accordingly, the instant invention comprises the complete sequences as 
reported in the accompanying Sequence Listing, as well as substantial portions of those 
sequences as defined above. 

20 "Codon degeneracy" refers to divergence in the genetic code permitting variation of 

the nucleotide sequence without effecting the amino acid sequence of an encoded 
polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment that 
encodes all or a substantial portion of the amino acid sequence encoding the acid or the 
neutral triacylglycerol lipase proteins as set forth in SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 

25 1 8, 20, 22, 24, 26, 28, 30, 32 and 34. The skilled artisan is well aware of the "codon-bias" 
exhibited by a specific host cell in usage of nucleotide codons to specify a given amino acid. 
Therefore, when synthesizing a gene for improved expression in a host cell, it is desirable to 
design the gene such that its frequency of codon usage approaches the frequency of 
preferred codon usage of the host cell. 

30 "Synthetic genes" can be assembled from oligonucleotide building blocks that are 

chemically synthesized using procedures known to those skilled in the art. These building 
blocks are iigated and annealed to form gene segments which are then enzymatically 
assembled to construct the entire gene. "Chemically synthesized", as related to a sequence 
of DNA, means that the component nucleotides were assembled in vitro. Manual chemical 

35 synthesis of DNA may be accomplished using well established procedures, or automated 
chemical synthesis can be performed using one of a number of commercially available 
machines. Accordingly, the genes can be tailored for optimal gene expression based on 
optimization of nucleotide sequence to reflect the codon bias of the host cell. The skilled 
artisan appreciates the likelihood of successful gene expression if codon usage is biased 
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towards those codons favored by the host. Determination of preferred codons can be based^ 
on a survey of genes derived from the host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, including 
regulatory sequences preceding (5* non-coding sequences) and following (3 r non-coding 
5 sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 
own regulatory sequences. "Chimeric gene" refers any gene that is not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 
Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 
are derived from different sources, or regulatory sequences and coding sequences derived 

10 from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
organism. A "foreign" gene refers to a gene not normally found in the host organism, but 
that is introduced into the host organism by gene transfer. Foreign genes can comprise 
native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 

15 that has been introduced into the genome by a transformation procedure. 

"Coding sequence" refers to a DNA sequence that codes for a specific amino acid 
sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5' non- 
coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, 
and which influence the transcription, RNA processing or stability, or translation of the 

20 associated coding sequence. Regulatory sequences may include promoters, translation 
leader sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a DNA sequence capable of controlling the expression of a 
coding sequence or functional RNA. In general, a coding sequence is located 3' to a 
promoter sequence. The promoter sequence consists of proximal and more distal upstream 

25 elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a 
DNA sequence which can stimulate promoter activity and may be an innate element of the 
promoter or a heterologous element inserted to enhance the level or tissue-specificity of a 
promoter. Promoters may be derived in their entirety from a native gene, or be composed of 
different elements derived from different promoters found in nature, or even comprise 

30 synthetic DNA segments. It is understood by those skilled in the art that different promoters 
may direct the expression of a gene in different tissues or cell types, or at different stages of 
development, or in response to different environmental conditions. Promoters which cause a 
gene to be expressed in most cell types at most times are commonly referred to as 
"constitutive promoters". New promoters of various types useful in plant cells are 

35 constantly being discovered; numerous examples may be found in the compilation by 

Okamuro and Goldberg, (1989) Biochemistry of Plants 75:1-82. It is further recognized that 
since in most cases the exact boundaries of regulatory sequences have not been completely 
defined, DNA fragments of different lengths may have identical promoter activity. 
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The "translation leader sequence" refers to a DNA sequence located between the * 
promoter sequence of a gene and the coding sequence. The translation leader sequence is 
present in the folly processed mRNA upstream of the translation start sequence. The 
translation leader sequence may affect processing of the primary transcript to mRNA, 
mRNA stability or translation efficiency. Examples of translation leader sequences have 
been described (Turner, R. and Foster, G. D. (1995) Molecular Biotechnology J:225). 

The "3' non-coding sequences" refer to DNA sequences located downstream of a 
coding sequence and include polyadenylation recognition sequences and other sequences 
encoding regulatory signals capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid 
tracts to the 3* end of the mRNA precursor. The use of different 3* non-coding sequences is 
exemplified by Ingelbrecht et al. (1989) Plant Cell 7:671-680. 

"RNA transcript" refers to the product resulting from RNA polymerase-catalyzed 
transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 
sequence derived from posttranscriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the RNA that is 
without introns and that can be translated into protein by the cell. "cDNA" refers to a 
double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 
refers to RNA transcript that includes the mRNA and so can be translated into protein by the 
cell. "Antisense RNA" refers to a RNA transcript that is complementary to all or part of a 
target primary transcript or mRNA and that blocks the expression of a target gene (U.S. 
Patent No. 5,107,065, incorporated herein by reference). The complementarity of an 
antisense RNA may be with any part of the specific gene transcript, i.e.. at the 5' non-coding 
sequence, 3* non-coding sequence, introns, or the coding sequence. "Functional RNA" 
refers, to sense RNA. antisense RNA, ribozyme RNA, or other RNA thai may not be 
translated but yet has an effect on cellular processes. 

The term "operably linked" refers to the association of nucleic acid sequences on a 
single nucleic acid fragment so that the function of one is affected by the other. For 
example, a promoter is operably linked with a coding sequence when it is capable of 
affecting the expression of that coding sequence (i.e., that the coding sequence is under the 
transcriptional control of the promoter). Coding sequences can be operably linked to 
regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of 
the invention. Expression may also refer to translation of mRNA into a polypeptide. 
"Antisense inhibition" refers to the production of antisense RNA transcripts capable of 
suppressing the expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of production in normal or 
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non-transformed organisms. "Co-suppression" refers to the production of sense RNA ** 
transcripts capable of suppressing the expression of identical or substantially similar foreign 
or endogenous genes (U.S. Patent No. 5.231,020, incorporated herein by reference). 

"Altered levels" refers to the production of gene product(s) in transgenic organisms in 
5 amounts or proportions that differ from that of normal or non-transformed organisms. 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one from 
which any pre- or propeptides present in the primary translation product have been removed. 
"Precursor" protein refers to the primary product of translation of mRNA; i.e., with pre- and 
propeptides still present. Pre- and propeptides may be but are not limited to intracellular 
10 localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is translated in 
conjunction with a protein and directs the protein to the chloroplast or other plastid types 
present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a 
nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an 
15 amino acid sequence which is translated in conjunction with a protein and directs the protein 
to the secretory system (Chrispeels, J. J., (1991) Ann. Rev. Plant Phys. Plant Mol Biol 
42:21-53). If the protein is to be directed to a vacuole, a vacuolar targeting signal {supra) 
can further be added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention 
signal (supra) may be added. If the protein is to be directed to the nucleus, any signal 
20 peptide present should be removed and instead a nuclear localization signal included 
(Raikhel (1992) Plant Phys. 700:1627-1632). 

"Transformation" refers to the transfer of a nucleic acid fragment into the genome of a 
host organism, resulting in genetically stable inheritance. Host organisms containing the 
transformed nucleic acid fragments are referred to as "transgenic" organisms. Examples of 
25 methods of plant transformation include Agrobacterium-mediated transformation (De Blaere 
et al. (1987) Math. Enzymol 143:211) and particle-accelerated or "gene gun" transformation 
technology (Klein T. M. et al. (1987) Nature (London) 327:10-13; U.S. Patent 
No. 4,945,050, incorporated herein by reference). 

Standard recombinant DNA and molecular cloning techniques used herein are well 
30 known in the art and are described more fully in Sambrook, J., Fritsch, E. F. and 

Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory 
Press: Cold Spring Harbor, 1989 (hereinafter "Maniatis"). 

Nucleic acid fragments encoding at least a portion of several triacylglycerol lipases 
have been isolated and identified by comparison of random plant cDNA sequences to public 
35 databases containing nucleotide and protein sequences using the BLAST algorithms well 
known to those skilled in the art. Table 1 lists the proteins that are described herein, and the 
designation of the cDNA clones that comprise the nucleic acid fragments encoding these 
proteins. 
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TABLE 1 



Triacylglycerol Lipases 


Enzyme 

Clone 

Plant 

Triacylglycerol Acid Lipase 

cen3n.pk0129.e9 

Com 


Contig of: 

p0075.cslag33r 
p0126.cnlay46r 
p0014.ctuty54r 

Corn 


p0102.ceral64r 

Corn 


p0126.cnlcm37r 

Corn 


ncs.pk0013.hl 

Catalpa 


rlr72.pk0015.b2 

Rice 


rslln.pk012.h7 

Rice 


Contig of: 

sdp3c.pk004.n3 
ssl.pk0022.al 

Soybean 


Contig of: 

slslc.pk009.o2 

srrlc.pk001.ml9 

sre.pk0004.d7 

Soybean 

Triacylglycerol Neutral Lipase 

crln.pk0145.c6 

Corn 


Contig of: 

p0010.cbpbe40r 

p0083.cldcql7r 

p0048.cqlac25r 

p0118xhsbw59r 

crl.pk0011.c9 

cdolc nk002 c?9 

Corn 


Contig of: 

p0037.crwan02r 

p0004xblfin22r 

p0004.cblei43r 

ccoln.pk068.o9 

n0093 c<5<;ao^9r 

crln.pk0127.h8 

f"*rvm 


rdrlf.pk002.fll 

Rice 


Contig of: 

sahlc.pk00I.k20 
sre.pk0058.bl 

Soybean 


srl.pk0079.el 

Soybean 


wrl.pk0115.f5 

Wheat 


The nucleic acid fragments of the instant invention may be used to isolate cDNAs and 
5 genes encoding homologous proteins from the same or other plant species. Isolation of 
homologous genes using sequence-dependent protocols is well known in the art. Examples 
of sequence-dependent protocols include, but are not limited to, methods of nucleic acid 
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hybridization, and methods of DNA and RNA amplification as exemplified by various uses*" 
of nucleic acid amplification technologies (e.g., polymerase chain reaction, ligase chain 
reaction). 

For example, genes encoding other acid triacylglycerol lipases, either as cDNAs or 
5 genomic DNAs, could be isolated directly by using all or a portion of the instant nucleic acid 
fragments as DNA hybridization probes to screen libraries from any desired plant employing 
methodology well known to those skilled in the art. Specific oligonucleotide probes based 
upon the instant nucleic acid sequences can be designed and synthesized by methods known 
in the art (Maniatis). Moreover, the entire sequences can be used directly to synthesize 

10 DNA probes by methods known to the skilled artisan such as random primer DNA labeling, 
nick translation, or end-labeling techniques, or RNA probes using available in vitro 
transcription systems. In addition, specific primers can be designed and used to amplify a 
part or all of the instant sequences. The resulting amplification products can be labeled 
directly during amplification reactions or labeled after amplification reactions, and used as 

15 probes to isolate full length cDNA or genomic fragments under conditions of appropriate 
stringency. 

In addition, two short segments of the instant nucleic acid fragments may be used in 
polymerase chain reaction protocols to amplify longer nucleic acid fragments encoding 
homologous genes from DNA or RNA. The polymerase chain reaction may also be 

20 performed on a library of cloned nucleic acid fragments wherein the sequence of one primer 
is derived from the instant nucleic acid fragments, and the sequence of the other primer takes 
advantage of the presence of the polyadenylic acid tracts to the 3' end of the mRNA 
precursor encoding plant genes. Alternatively, the second primer sequence may be based 
upon sequences derived from the cloning vector. For example, the skilled artisan can follow 

25 the RACE protocol (Frohman et al. ( 1 988) Proc. Natl Acad. ScL USA 55:8998) to generate 
cDNAs by using PCR to amplify copies of the region between a single point in the 
transcript and the 3' or 5' end. Primers oriented in the 3' and 5' directions can be designed 
from the instant sequences. Using commercially available 3* RACE or 5' RACE systems 
(BRL), specific 3' or 5* cDNA fragments can be isolated (Ohara et al. (1989) Proc. Natl 

30 Acad. ScL USA 86:5673: Loh et al. (1989) Science 243:217). Products generated by the 3' 
and 5 f RACE procedures can be combined to generate full-length cDNAs (Frohman, M. A. 
and Martin, G. R., (1989) Techniques 7:165). 

Availability of the instant nucleotide and deduced amino acid sequences facilitates 
immunological screening of cDNA expression libraries. Synthetic peptides representing 

35 portions of the instant amino acid sequences may be synthesized. These peptides can be 
used to immunize animals to produce polyclonal or monoclonal antibodies with specificity 
for peptides or proteins comprising the amino acid sequences. These antibodies can be then 
be used to screen cDNA expression libraries to isolate full-length cDNA clones of interest 
(Lerner, R. A. (\9&4) Adv. Immunol. 36: 1 ; Maniatis). 
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The nucleic acid fragments of the instant invention may be used to create transgenic * 
plants in which the disclosed acid or neutral triacylglycerol lipases are present at higher or 
lower levels than normal or in cell types or developmental stages in which they are not 
normally found. This would have the effect of altering the level of triacylglycerol and 
5 cholesteryl esters in those cells. Accumulation of fatty acids with unusual structures may be 
a positive phenotype in plants used for foods. Triacylglycerol lipases may also be useful in 
processing of plant seed oils and the development of novel seed oils. 

Overexpression of the acid or the neutral triacylglycerol lipases of the instant 
invention may be accomplished by first constructing a chimeric gene in which the coding 
10 region is operably linked to a promoter capable of directing expression of a gene in the 
desired tissues at the desired stage of development. For reasons of convenience, the 
chimeric gene may comprise promoter sequences and translation leader sequences derived 
from the same genes. 3' Non-coding sequences encoding transcription termination signals 
may also be provided. The instant chimeric gene may also comprise one or more introns in 
15 order to facilitate gene expression. 

Plasmid vectors comprising the instant chimeric gene can then constructed. The 
choice of plasmid vector is dependent upon the method that will be used to transform host 
plants. The skilled artisan is well aware of the genetic elements that must be present on the 
plasmid vector in order to successfully transform, select and propagate host cells containing 
20 the chimeric gene. The skilled artisan will also recognize that different independent 

transformation events will result in different levels and patterns of expression (Jones et al. 
(1985) EMBOJ. 4:24\ 1-2418; De Almeida et al. (1989) Mol Gen. Genetics 275:78-86), and 
thus that multiple events must be screened in order to obtain lines displaying the desired 
expression level and pattern. Such screening may be accomplished by Southern analysis of 
25 DNA, Northern analysis of mRNA expression. Western analysis of protein expression, or 
phenotypic analysis. 

For some applications it may be useful to direct the instant triacylglycerol lipase to 
different cellular compartments, or to facilitate its secretion from the cell. It is thus 
envisioned that the chimeric gene described above may be further supplemented by altering 

30 the coding sequence to encode a acid triacylglycerol lipase with appropriate intracellular 
targeting sequences such as transit sequences (Keegstra, K. (1 989) Cell 56:247-253), signal 
sequences or sequences encoding endoplasmic reticulum localization (Chrispeels, J. J., 
{\99\) Ann. Rev. Plant Phys. Plant Mol. Biol. </2:21-53), or nuclear localization signals 
(Raikhel, N. (1992) Plant Phys. 700:1627-1632) added and/or with targeting sequences that 

35 are already present removed. While the references cited give examples of each of these, the 
list is not exhaustive and more targeting signals of utility may be discovered in the future. 

It may also be desirable to reduce or eliminate expression of genes encoding acid or 
neutral triacylglycerol lipases in plants for some applications. In order to accomplish this, a 
chimeric gene designed for co-suppression of the instant triacylglycerol lipase can be 
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constructed by linking a gene or gene fragment encoding an acid or a neutral triacylglycerol 
lipase to plant promoter sequences. Alternatively, a chimeric gene designed to express 
antisense RNA for all or part of the instant nucleic acid fragment can be constructed by 
linking the gene or gene fragment in reverse orientation to plant promoter sequences. Either 
5 the co-suppression or antisense chimeric genes could be introduced into plants via 

transformation wherein expression of the corresponding endogenous genes are reduced or 
eliminated. 

The instant acid or neutral triacylglycerol lipases (or portions thereof) may be 
produced in heterologous host cells, particularly in the cells of microbial hosts, and can be 

10 used to prepare antibodies to the these proteins by methods well known to those skilled in 
the art. The antibodies are useful for detecting acid or neutral triacylglycerol lipases in situ 
in cells or in vitro in cell extracts. Preferred heterologous host cells for production of the 
instant acid or neutral triacylglycerol lipases are microbial hosts. Microbial expression 
systems and expression vectors containing regulatory sequences that direct high level 

15 expression of foreign proteins are well known to those skilled in the art. Any of these could 
be used to construct a chimeric gene for production of the instant acid or neutral 
triacylglycerol lipase. This chimeric gene could then be introduced into appropriate 
microorganisms via transformation to provide high level expression of the encoded 
triacylglycerol lipase. An example of a vector for high level expression of the instant acid or 

20 neutral triacylglycerol lipase in a bacterial host is provided (Example 7). 

All or a substantial portion of the nucleic acid fragments of the instant invention may 
also be used as probes for genetically and physically mapping the genes that they are a part 
of, and as markers for traits linked to those genes. Such information may be useful in plant 
breeding in order to develop lines with desired phenotypes. For example, the instant nucleic 

25 acid fragments may be used as restriction fragment length polymorphism (RFLP) markers. 
Southern blots (Maniatis) of restriction-digested plant genomic DNA may be probed with 
the nucleic acid fragments of the instant invention. The resulting banding patterns may then 
be subjected to genetic analyses using computer programs such as MapMaker (Lander et al. 
(1987) Genomics 7:174-181) in order to construct a genetic map. In addition, the nucleic 

30 acid fragments of the instant invention may be used to probe Southern blots containing 

restriction endonuclease-treated genomic DNAs of a set of individuals representing parent 
and progeny of a defined genetic cross. Segregation of the DNA polymorphisms is noted 
and used to calculate the position of the instant nucleic acid sequence in the genetic map 
previously obtained using this population (Botstein, D. et al. (1 980) Am. J. Hum. Genet 

35 52:314-331). 

The production and use of plant gene-derived probes for use in genetic mapping is 
described in R. Bernatzky, R. and Tanksley, S. D. (1986) Plant Mol Biol Reporter 
4(1):31-A\. Numerous publications describe genetic mapping of specific cDNA clones 
using the methodology outlined above or variations thereof. For example, F2 intercross 
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populations, backcross populations, randomly mated populations, near isogenic lines, and "* 
other sets of individuals may be used for mapping. Such methodologies are well known to 
those skilled in the art. 

Nucleic acid probes derived from the instant nucleic acid sequences may also be used 
5 for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel, J. D.. 
et al. In: Nonmammalian Genomic Analysis: A Practical Guide, Academic press 1 996, 
pp. 319-346. and references cited therein). 

In another embodiment, nucleic acid probes derived from the instant nucleic acid 
sequences may be used in direct fluorescence in situ hybridization (FISH) mapping (Trask, 

10 B. J. (1991) Trends Genet 7:149-154). Although current methods of FISH mapping favor 
use of large clones (several to several hundred KB; see Laan, M. et al. (1995) Genome 
Research 5:13-20), improvements in sensitivity may allow performance of FISH mapping 
using shorter probes. 

A variety of nucleic acid amplification-based methods of genetic and physical 

15 mapping may be carried out using the instant nucleic acid sequences. Examples include 
allele-specific amplification (Kazazian, H. H. (1989)/ Lab. Clin. Med. 1 M(2):95-96), 
polymorphism of PCR-amplified fragments (CAPS; Sheffield, V. C. et al. (1993) Genomics 
75:325-332), allele-specific ligation (Landegren, U. et al. (1988) Science 247:1077-1080), 
nucleotide extension reactions (Sokolov, B. P. (1990) Nucleic Acid Res, 75:3671), Radiation 

20 Hybrid Mapping (Walter, M. A. et al. (1997) Nature Genetics 7:22-28) and Happy Mapping 
(Dear, P. H. and Cook, P. R. (1989) Nucleic Acid Res. 77:6795-6807). For these methods, 
the sequence of a nucleic acid fragment is used to design and produce primer pairs for use in 
the amplification reaction or in primer extension reactions. The design of such primers is 
well known to those skilled in the art. In methods employing PCR-based genetic mapping. 

25 it may be necessary to identify DNA sequence differences between the parents of the 
mapping cross in the region corresponding to the instant nucleic acid sequence. This, 
however, is generally not necessary for mapping methods. 

Loss of function mutant phenotypes may be identified for the instant cDNA clones 
either by targeted gene disruption protocols or by identifying specific mutants for these 

30 genes contained in a maize population carrying mutations in all possible genes (Ballinger 
and Benzer, (1989) Proc. Natl. Acad Sci USA 56:9402; Koes et al. (1995) Proa Natl Acad. 
Sci USA 92:8149; Bensen et al. (1995) Plant Cell 7:75). The latter approach may be 
accomplished in two ways. First, short segments of the instant nucleic acid fragments may 
be used in polymerase chain reaction protocols in conjunction with a mutation tag sequence 

35 primer on DNAs prepared from a population of plants in which Mutator transposons or some 
other mutation-causing DNA element has been introduced (see Bensen, supra). The 
amplification of a specific DNA fragment with these primers indicates the insertion of the 
mutation tag element in or near the plant gene encoding the acid or the neutral 
triacylglycerol lipase. Alternatively, the instant nucleic acid fragment may be used as a 
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hybridization probe against PCR amplification products gerierafed from tEe mutation 
population using the mutation tag sequence primer in conjunction with an arbitrary genomic 
site primer, such as that for a restriction enzyme site-anchored synthetic adaptor. With 
either method, a plant containing a mutation in the endogenous gene encoding an acid or a 
5 neutral triacylglycerol lipase can be identified and obtained. This mutant plant can then be 
used to determine or confirm the natural function of the acid or the neutral triacylglycerol 
lipase gene product. 

EXAMPLES 

The present invention is further defined in the following Examples, in which all parts 
10 and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be 
understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 
skilled in the art can ascertain the essential characteristics of this invention, and without 
departing from the spirit and scope thereof, can make various changes and modifications of 
15 the invention to adapt it to various usages and conditions. 

EXAMPLE 1 

Composition of cDNA Libraries: Isolation and Sequencing of cDNA Clones 
cDNA libraries representing mRNAs from various Catalpa, corn, rice, soybean and 
wheat tissues were prepared. The characteristics of the libraries are described below. 



TABLE 2 



cDNA Libraries from Catalpa, Com, Rice. Soybean and Wheat 

Library 

Tissue 

Clone 

ceo In 

Corn Cob of 67 Day Old Plants Grown in Green House* 

ccoln.pk068.o9 

cdolc 

Corn Ovary (including pedicel and glumes), 5 Days After 

cdolc.pk002.c22 


Silking 

cen3n 

Corn Endosperm 20 Days After Pollination* 

cen3n.pk0129.e9 

crl 

Corn Root From 7 Day Old Seedlings 

crl.pk0011.c9 

crln 

Corn Root From 7 Day Old Seedlings* 

crln.pk0127.h8 
crln.pk0145.c6 

ncs 

Catalpa speciosa Developing Seed 

ncs.pk0013.hl 

p0004 

Corn Immature Ear 

p0004.cblei43r 

pOOlO 

Corn Log Phase Suspension Cells Treated With A23 1 87** 

p0004.cblfm22r 
p00l0.cbpbe40r 

p0014 

Corn Leaves 7 and 8 From 3 Foot-Tall Plant 

p0014.ctuty54r 

p0037 

Corn V5 Stage Roots Infested With Corn Root Worm 

p0037.crwan02r 

p0048 

Corn Embryo (Axis and Scuttelum) One Day After 

p0048.cqlac25r 


Germination 

p0075 

Corn Shoot And Leaf Material From Dark-Grown 

p0075.cslag33r 


7 Day-Old Seedlings 
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Library 

Tissue 

Clone 

p0083 

Corn Whole Kernels 7 Days After Pollination 

p0083.cldcql7r 

p0093 

Corn Stalk And Shank, 2-3 Weeks After Pollen Shed* 

d0093 cssao39r 

p0102 

Corn Parlv Meio<;i^ Ta<v??pl<i* 

p0102.ceral64r 

pOl 18 

Corn Stem Tissue Pooled From the 4 to 5 Internodes * 

p0118xhsbw59r 


Subtending The Tassel At Stages V8-V12, Night 


Harvested* 


p0126 

Corn Leaf Tissue From V8-V10 Stages, Pooled, 

D0126 cnlav46r 


Night-Harvested 

p0126.cnlcm37r 

rdrlf 

Rice Developing Root of 10 Day Old Plant 

rdrlf.pk002.ni 

rlr72 

Rice Leaf 15 Days After Germination, 72 Hours After 

rlr72.pk0015.b2 


Infection of Strain Magaporthe grisea 4360-R-62 


fAVPO VAMHV D^cictant 

\r\ v rvz- 1 f\ivi\j), jvesisiani 


rsiln 

Rice 1 5-Day-Old Seedling* 

rslln.pk012.h7 

sahlc 

Soybean Sprayed With Authority™ Herbicide 

sahlc.pk001.k20 

sdp3c 

Soybean Developing Pods (8-9 mm) 

sdp3c.pk004.n3 

slslc 

soybean intected With Sclerotinia sclerotiorum Mycelium 

slslc.pk009.o2 

srl 

Soybean Root 

srl.pk0079.el 

sre 

Soybean Root Elongation Zone 4 to 5 Days After 

sre.pk0004.d7 


Germination 

sre.pk0058.bl 

srrlc 

Soybean 8-Day-Old Root 

srrlc.pk001.mI9 

ssl 

Soybean Seedling 5-10 Days After Germination 

ssl.pk0022.al 

wrl 

Wheat Root From 7 Day Old Seedling 

wrl.pk0115-.f5 


♦These libraries were normalized essentially as described in U.S. Patent No. 5,482,845 
**A23187 is commercially available from several sources including Calbiochem. 

cDNA libraries were prepared in Uni-ZAP™ XR vectors according to the 
manufacturer's protocol (Stratagene Cloning Systems, La Jolla, CA). Conversion of the 
Uni-ZAP™ XR libraries into plasmid libraries was accomplished according to the protocol 
provided by Stratagene. Upon conversion, cDNA inserts were contained in the plasmid 
vector pBluescript. cDNA inserts from randomly picked bacterial colonies containing 
recombinant pBluescript plasmids were amplified via polymerase chain reaction using 
primers specific for vector sequences flanking the inserted cDNA sequences or plasmid 
DNA was prepared from cultured bacterial cells. Amplified insert DNAs or plasmid DNAs 
were sequenced in dye-primer sequencing reactions to generate partial cDNA sequences 
(expressed sequence tags or "ESTs"; see Adams, M. D. et al. (1991) Science 252:1651). 
The resulting ESTs were analyzed using a Perkin Elmer Model 377 fluorescent sequencer. 
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EXAMPLE 2 ^ 
Identification of cDNA Clones 
ESTs encoding triacylglycerol lipases were identified by conducting BLAST (Basic 
Local Alignment Search Tool; Altschul, S. F., et al. (1993) J. Mol Biol 275:403-410; see 
also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences contained in the 
BLAST "nr" database (comprising all non-redundant GenBank CDS translations, sequences 
derived from the 3-dimensional structure Brookhaven Protein Data Bank, the last major 
release of the SW1SS-PROT protein sequence database, EMBL, and DDBJ databases). The 
cDNA sequences obtained in Example 1 were analyzed for similarity to all publicly 
available DNA sequences contained in the "nr" database using the BLASTN algorithm 
provided by the National Center for Biotechnology Information (NCBI). The DNA 
sequences were translated in all reading frames and compared for similarity to all publicly 
available protein sequences contained in the "nr ,; database using the BLASTX algorithm 
(Gish. W. and States, D. J. (1993) Nature Genetics 3:266-272) provided by the NCBI. For 
convenience, the P-value (probability) of observing a match of a cDNA sequence to a 
sequence contained in the searched databases merely by chance as calculated by BLAST are 
reported herein as "pLog" values, which represent the negative of the logarithm of the 
reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the 
cDNA sequence and the BLAST "hit" represent homologous proteins. 

EXAMPLE 3 

Characterization of cDNA Clones Encoding Acid Triacvl glycerol Lipases 
The BLASTX search using the EST sequences from clones cen3n.pk0129.e9 r 
ncs.pk0013.h! , a contig sequence assembled from the EST sequences from clones 
rlr72.pk0015.b2 and rrl.pk0051.fI0, a contig sequence assembled from the EST sequences 
of clones ssl.pk0022.al and srl.pk0098.bl 1, and a contig sequence assembled from the EST 
sequences from clones sre.pk0004.d7 and sre.pk0001.b2 revealed similarity of the proteins 
encoded by the cDNAs and the contigs to acid triacylglycerol lipases from human and rat 
(GenBank Accession Nos. are listed below). The BLAST results for each of these ESTs and 
contigs are shown in Table 3: 
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TABLE 3 

BLAST Results for Clones Encoding Polypeptides Homologous 
to Acid Triacyl glycerol Lipases 


Clone 

Oreanism 


t>LAi i pLog bcore 

cen3n.pk0129.e9 

Human 

X05997 

14.52 

ncs.pk0013.hl 

Rat 

X02309 

14.70 

Contig of 

rlr72.pk0015.b2 
rrl.pk0051.fl0 

Human 

U08464 

16.40 

Contig of 
ssl.pk0022.al 
srl.pk0098.bll 

Rat 

X02309 

15.22 

Contig of 

sre.pk0004.d7 
sre.pk0001.b2 

Human 

X76488 

22.00 


TBLASTN analysis of the proprietary plant EST database indicated that other corn, 
rice and soybean sequences also encoded acid triacylglycerol lipases. The BLASTX search 
using the contig sequences assembled with the EST sequences from clones p0075.cslag33r. 
p0126.cnlay46r and p0014.ctuty54r revealed similarity of the proteins encoded by the 
cDNAs to acid triacylglycerol lipase from Homo sapiens (NCBI General Identifier 
No. 505053). The BLASTX search using the EST sequences from clones p0102.ceral64r 
and using the contig sequences assembled from the entire cDNA insert in clone 
ssl.pk0022.al and the EST sequences from clone sdp3c.pk004.n3 revealed similarity of the 
proteins encoded by the cDNAs to acid triacylglycerol lipase from Cams familiahs (NCBI 
General Identifier No. 3041702). The BLASTX search using the EST sequences from clone 
pOl 26xnlcm37r revealed similarity of the proteins encoded by the cDNAs to Drosophila 
melanogaster (NCBI General Identifier No. 2894442). The BLASTX search using the EST 
sequences from clone rslln.pk012.h7 revealed similarity of the proteins encoded by the 
cDNAs to acid triacylglycerol lipase from Rattus norvegicus (NCBI General Identifier 
No. 126307). The BLAST results for each of these sequences is shown in Table 4: 
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TABLE 4 


BLAST Results for Clones Encoding Polypeptides Homologous 
to Acid Triacylglycerol Lipase 


Clone 

NCBI General Identifier No. 

BLAST pLog Score 

Contig of: 

p0075.cslag33r 
p0126.cnlay46r 
p0014.ctuty54r 

505053 

35.00 

p0102.ceral64r 

3041702 

11.30 

p0126.cnlcm37r 

2894442 

10.40 

rsiln.pk012.h7 

126307 

7.00 


5 The sequence of the entire cDNA insert in clone cen3n.pk0129.e9 was determined and 

is shown in SEQ ID NO: 1; the deduced amino acid sequence of this cDNA is shown in SEQ 
ID NO:2. The amino acid sequence set forth in SEQ ID NO:2 was evaluated by BLASTP, 
yielding a pLog value of 1 5.00 versus the Homo sapiens sequence (NCBI General Identifier 
No. 126306). The sequence of the 3'-terminaI portion from clone ncs.pk0013.hl is shown in 

10 SEQ ID NO:3; the deduced amino acid sequence of this cDNA is shown in SEQ ID NO:4. 
The sequence of the 5 '-terminal portion from clone ncs.pk0013.hl is shown in SEQ ID 
NO:5; the deduced amino acid sequence of this cDNA is shown in SEQ ID NO:6. The 
sequence of the contig assembled from the EST sequences from clones p0075.cslag33r, 
p0126.cnlay46r and p0014.ctuty54r is shown in SEQ ID NO:7, the deduced amino acid 

15 sequence of this cDNA is shown in SEQ ID NO:8. The sequence of a portion of the cDNA 
insert from clone pO 1 02.ceral64r is shown in SEQ ID NO:9; the deduced amino acid 
sequence of this cDNA is shown in SEQ ID NO: 10. The sequence of a portion of the cDNA 
insert from clone p0126.cnlcm37r is shown in SEQ ID NO: 1 1; the deduced amino acid 
sequence of this cDNA is shown in SEQ ID NO: 12. The sequence of the entire cDNA insert 

20 in clone rlr72.pk001 5.b2 was determined and is shown in SEQ ID NO: 1 3; the deduced 
amino acid sequence of this cDNA is shown in SEQ ID NO: 14. The amino acid sequence 
set forth in SEQ ID NO:14 was evaluated by BLASTP, yielding a pLog value of 53.30 
versus the C elegans sequence (NCBI General Identifier No. 3165581). The sequence of a 
portion of the cDNA insert from clone rslln.pk012.h7 is shown in SEQ ID NO: 15; the 

25 deduced amino acid sequence of this cDNA is shown in SEQ ID NO: 1 6. The sequence of 
the entire cDNA insert in clone ssl.pk0022.al was determined and a contig assembled with 
this sequence and the EST sequences from clone sdp3c.pk004.n3. The sequence of this 
contig is shown in SEQ ID NO: 17; the deduced amino acid sequence of this cDNA is shown 
in SEQ ID NO: 1 8. The amino acid sequence set forth in SEQ ID NO: 1 8 was evaluated by 

30 BLASTP, yielding a pLog value of 59.40 versus the C. familiaris sequence (NCBI General 
Identifier No. 3041702). The sequence of the entire cDNA insert in clone sre.pk0004.d7 
was determined and a contig assembled with this sequence and the EST sequences from 
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clones slslc.pk009.o2 and srrlc.pk001.ml9. The sequence of this contig is shown in SEQ ~" 
ID NO: 19; the deduced amino acid sequence of this cDNA is shown in SEQ ID NO:20. The 
amino acid sequence set forth in SEQ ID NO:20 was evaluated by BLASTP, yielding a 
pLog value of 48.70 versus the C elegans sequence (NCBI General Identifier No. 3 1 65581). 
5 Figure 1 presents an alignment of the amino acid sequences set forth in SEQ ID 

NOs:14, 18 and 20 with the Canis familiaris sequence (NCBI General Identifier 
No. 3041702; SEQ ID NO:35) and the Caenorhabditis elegans sequence (NCBI General 
Identifier No. 3165581; SEQ ID NO:36). The data in Table 5 presents a calculation of the 
percent similarity of the amino acid sequences set forth in SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 
10 18 and 20 and the Caenorhabditis elegans sequence. 

TABLE 5 

Percent Similarity of Amino Acid Sequences Deduced From the Nucleotide Sequences of 
cDNA Clones Encoding Polypeptides Homologous 
15 to Acid Triacylgiyceroi Lipase 

Percent Identity to 


Clone 

SEQ ID NO. 

3041702 

3165581 

cen3n.pk0129.e9:fis 

2 

27.1 

22.9 

ncs.pk0013.hl.fisl 

4 

27.4 

21.4 

ncs.pk0013.hl.fis2 

6 

30.6 

29.9 

p0075.cslag33r 
p0126.cnlay46r 
p0014.ctuty54r 

8 

22.0 

23.1 

p0102.ceral64r 

10 

28.8 

22.4 

p0126.cnlcm37r 

12 

26.7 

22.2 

rlr72.pk0015.b2:fis 

14 

24.9 

25.6 

rslln.pk012.h7 

16 

22.5 

22.5 

sdp3c.pk004.n3 
ssl.pk0022.al.fisl 

18 

27.4 

23.1 

slslc.pk009.o2 

srrlc.pk001.ml9 

sre.pk0004.d7.fisl 

20 

23.1 

24.8 


Sequence alignments and percent similarity calculations were performed using the Megalign 
program of the LASARGENE bioinformatics computing suite (DNASTAR Inc., Madison, 
WI). Multiple alignment of the amino acid sequences was performed using the Ciustal 
method of alignment (Higgins, D. G. and Sharp, P. M. (1989) CABIOS. 5:151-153) with the 
default parameters (GAP PENALTY=10, GAP LENGTH PEN ALT Y= 10). 

Sequence alignments and BLAST scores and probabilities indicate that the instant 
nucleic acid fragments encode an entire rice acid triacylgiyceroi lipase, two different entire 
soybean acid triacylgiyceroi lipases, portions from several different corn acid triacylgiyceroi 
lipases, portions of a Catalpa acid triacylgiyceroi lipase and a portion of a rice acid 
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triacylglycerol lipase. These sequences represent the first plant sequences encoding acid 
triacylglycerol lipases. 

EXAMPLE 4 

Characterization of cDNA Clones Encoding Neutral Triacylglycerol Lipases 
5 The BLASTX search using the contig sequence assembled from the EST sequences 

from clones crln.pk0127.h8 and crln.pk0134.d3, and EST sequences from clones 
crln.pk0145.c6, sl.03b01, se3.01a04, sfll.pk0049.dl 1, srl.pk0079.el, srl.pk0030.g8, 
sre.pk0058.bl, wlln.pk0014.el0, wlln.pk0038.e3 and wrl.pk0115.f5 revealed similarity of 
the proteins encoded by the contig and the cDNAs to neutral triacylglycerol lipases from 
10 several organisms. Table 6 shows the BLAST results for the contig and each of the ESTs, 
the NCBI database accession number, and the organism the closest art sequence is derived 
from: 

TABLE 6 

15 BLAST Results for Clones Encoding Polypeptides Homologous 


to Neutral Triacylglycerol Lipases 


Clone 

Organism 

NCBI 
Accession No. 

BLAST 
pLop Score 

Contig of: 

crln.pk0127.h8 
crln.pk0134.d3 

Thermomyces lanuginosus 

999873 

10.00 

crln.pk0145.c6 

Caenorhabditis elegans 

927399 

8.70 

srl.pk0079.el 

Rhizopus niveus 

251079 

6.70 

sre.pk0058.bl 

Rhizomucor miehei 

417256 

8.10 

wrl.pk0115.f5 

Rhizomucor miehei 

82777 

6.00 


TBLASTN analysis of the proprietary plant EST database indicated that rice clones as 
well as other corn and soybean clones also encode neutral triacylglycerol lipases. The 

20 BLASTX search using the contig sequences assembled from clones p0010.cbpbe40r ? 

p0083.cldcq!7r, p0048.cqlac25r, pOI 18xhsbw59r, crl.pkOOl 1x9 and cdolc.pk002.c22 and 
using the EST sequences from clone rdrlf.pk002.fl 1 revealed similarity of the proteins 
encoded by the cDNAs to neutral triacylglycerol lipase from C elegans (NCBI General 
Identifier No.3 877256). The BLAST results for each of these sequences are shown in 

25 Table 7: 
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TABLE 7 


BLAST Results for Clones Encoding Polypeptides Homologous 
to Neutral Triacylglycerol Lipases 


Clone 

Organism 

NCBI General 
Identifier No. 

BLAST pLog 
Score 

crln.pk0145.c6 

Caenorhabditis elegans 

3877256 

9.30 

Contig of: 

p0010xbpbe40r 

nOOJH r1dml7r 

UUV/O J ■ vlULU J / i 

p0048xqlac25r 
p0118xhsbw59r 
crl.pk0011x9 
cdolc.pk002x22 

Caenorhabditis elegans 

3877256 

18.40 

Contig of: 

puuj /.crwanu^dr 

p0004.cblfm22r 

p0004.cblei43r 

ccoln.pk068.o9 

p0093.cssao39r 

crln.pk0127.h8 

Thermomyces lamiginosns 

2997733 

6.15 

rdrlf.pk002.fll 

Caenorhabditis elegans 

3877256 

10.22 

Contig of: 

sahlc.pk001.k20 
sre.pk0058.bl 

Rhizomucor miehei 

417256 

6.22 

srl.pk0079.el 

Rhizopus niveus 

3299795 

5.70 

wrl.pk0115.f5 

Caenorhabditis elegans 

3877256 

14.00 


5 The sequence of the entire cDNA insert in clone crln.pkO 145x6 was determined and 

is shown in SEQ ID NO:21; the deduced amino acid sequence of this cDNA is shown in 
SEQ ID NO:22. The amino acid sequence set forth in SEQ ID NO:2 was evaluated by 
BLASTP, yielding a pLog value of 10.70 versus the C. elegans sequence. The sequence of 
the contig assembled from a portion of the cDNA insert in clones p0010xbpbe40r r 

10 p0083xldcql 7r, p0048xqlac25r ; pOl 18xhsbw59r, crl .pkOOl 1 x9 and cdolc.pk002x22 is 
shown in SEQ ID NO:23; the deduced amino acid sequence of this cDNA is shown in SEQ 
ID NO:23. The sequence of the entire cDNA insert in clone crln.pk0127.h8 was determined 
and a contig assembled with this sequence and the sequence from a portion of the cDNA 
insert in clones p0037xrwan02r, p0004xblfm22r, p0004xblei43r, ccoln.pk068.o9 and 

15 p0093xssao39r. The sequence of this contig is shown in SEQ ID NO:25; the deduced 

amino acid sequence of this cDNA is shown in SEQ ID NO:26. The amino acid sequence 
set forth in SEQ ID NO:4 was evaluated by BLASTP, yielding a pLog value of 9.70 versus 
the Thermomyces lanuginosus sequence. The sequence of a portion of the cDNA insert 
from clone rdrlf.pk002.fl 1 is shown in SEQ ID NO:27; the deduced amino acid sequence of 

20 this cDNA is shown in SEQ ID NO:28. The sequence of the entire cDNA insert in clone 
sre.pk0058.bl was determined and a contig assembled with this sequence and the sequence 
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of a portion of the cDNA insert in clone sahlc.pk001.k20. The sequence of this contig is 
shown in SEQ ID NO:29; the deduced amino acid sequence of this cDNA is shown in SEQ 
ID NO:30. The amino acid sequence set forth in SEQ ID NO:30 was evaluated by 
BLASTP, yielding a pLog value of 8.05 versus the Rhizomucor miehei sequence. The 
5 sequence of the entire cDNA insert in clone sr 1 .pk0079.el was determined and is shown in 
SEQ ID NO:3 1 ; the deduced amino acid sequence of this cDNA is shown in SEQ ID NO:32. 
The amino acid sequence set forth in SEQ ID NO:32 was evaluated by BLASTP, yielding a 
pLog value of 7.52 versus the Rhizopus niveus sequence. The sequence of the entire cDNA 
insert in clone wrl .pkOl 1 5.f5 was determined and is shown in SEQ ID NO:33; the deduced 

10 amino acid sequence of this cDNA is shown in SEQ ID NO;34. The amino acid sequence 
set forth in SEQ ID NO:34 was evaluated by BLASTP, yielding a pLog value of 1 3.52 
versus the Caenorhabditis elegans sequence. 

The data in Table 8 presents a calculation of the percent similarity of the amino acid 
sequences set forth in SEQ ID NOs:22, 24, 26, 28, 30, 32 and 34 and the Caenorhabditis 

15 elegans^ Rhizomucor miehei and Thermomyces lanuginosus sequences. 

TABLE 8 

Percent Similarity of Amino Acid Sequences Deduced From the Nucleotide Sequences 
of cDNA Clones Encoding Polypeptides Homologous 
20 to Neutral Triacylglycerol Lipase 





Percent Similarity to 


Clone 

SEQ ID NO. 

3877256 

2997733 

417256 

crln.pk0145.c6 

22 

15.1 

13.2 

16.8 

Contig of: 

p0010.cbpbe40r 

p0083.cldcql7r 

p0048.cqlac25r 

p0118.chsbw59r 

crl.pk0011.c9 

cdolc.pk002.c22 

24 

60.5 

17.5 

22.9 

Contig of: 

p0037.crwan02r 

p0004.cblfm22r 

p0004.cblei43r 

ccoln.pk068.o9 

p0093.cssao39r 

crln.pk0127.h8 

26 

18.5 

14.3 

15.1 

rdrlf.pk002.fll 

28 

12.6 

20.6 

22.9 

Contig of: 

sahlc.pk001.k20 
sre.pk0058.bl 

32 

15.1 

10.5 

17.0 

srl.pk0079.el 

34 

14.3 

21.1 

24.6 

wrl.pk0115.fS 

36 

37.0 

22.0 

26.0 
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Sequence alignments and percent similarity calculations were performed using trie MegaJign 
program of the LASARGENE bioinformatics computing suite (DNASTAR Inc., Madison, 
WI). Multiple alignment of the amino acid sequences was performed using the Clustal 
method of alignment (Higgins, D. G. and Sharp, P. M. (1989) CABIOS. 5:151-153) with the 
5 default parameters (GAP PENALTY^] 0, GAP LENGTH PENALTY=10). 

Sequence alignments and BLAST scores and probabilities indicate that the instant 
nucleic acid fragments encode three different corn neutral triacylglycerol lipases(one portion 
and two entire or nearly entire), two different soybean triacylglycerol lipases (one portion 
and one nearly entire) and a portion of a wheat triacylglycerol lipase. These sequences 

10 represent the first monocot and soybean sequences encoding neutral triacylglycerol lipases. 

EXAMPLE 5 
Expression of Chimeric Genes in Monocot Cells 
A chimeric gene comprising a cDNA encoding triacylglycerol lipases in sense 
orientation with respect to the maize 27 kD zein promoter that is located 5' to the cDNA 

15 fragment, and the 10 kD zein 3' end that is located 3' to the cDNA fragment, can be 
constructed. The cDNA fragment of this gene may be generated by polymerase chain 
reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites 
(Nco 1 or Sma I) can be incorporated into the oligonucleotides to provide proper orientation 
of the DNA fragment when inserted into the digested vector pML103 as described below. 

20 Amplification is then performed in a standard PCR. The amplified DNA is then digested 
with restriction enzymes Nco I and Smal and fractionated on an agarose gel. The 
appropriate band can be isolated from the gel and combined with a 4.9 kb Nco I-Sma 1 
fragment of the plasmid pML103. Plasmid pML103 has been deposited under the terms of 
the Budapest Treaty at ATCC (American Type Culture Collection, 10801 University Blvd., 

25 Manassas, VA 201 10-2209), and bears accession number ATCC 97366. The DNA segment 
from pMLl 03 contains a 1 .05 kb Sal I-Nco I promoter fragment of the maize 27 kD zein 
gene and a 0.96 kb Sma 1-Sal I fragment from the 3' end of the maize 10 kD zein gene in the 
vector pGem9Zf(+) (Promega). Vector and insert DNA can be ligated at 1 5°C overnight, 
essentially as described (Maniatis). The ligated DNA may then be used to transform E. coli 

30 XLl-Blue (Epicurian Coli XL-1 Blue™; Stratagene). Bacterial transformants can be 

screened by restriction enzyme digestion of plasmid DNA and limited nucleotide sequence 
analysis using the dideoxy chain termination method (Sequenase™ DNA Sequencing Kit; 
U.S. Biochemical). The resulting plasmid construct would comprise a chimeric gene 
encoding, in the 5* to 3' direction, the maize 27 kD zein promoter, a cDNA fragment 

35 encoding a triacylglycerol lipase, and the 10 kD zein 3' region. 

The chimeric gene described above can then be introduced into com cells by the 
following procedure. Immature corn embryos can be dissected from developing caryopses 
derived from crosses of the inbred corn lines H99 and LHI32. The embryos are isolated 10 
to 1 1 days after pollination when they are 1 .0 to 1 .5 mm long. The embryos are then placed 
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with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu et al. 
(1975) ScL Sin. Peking 18:659-668). The embryos are kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses of cells with somatic 
proembryoids and embryoids bome on suspensor structures proliferates from the scutellum 
5 of these immature embryos. The embryogenic callus isolated from the primary explant can 
be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks. 

The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, Frankfurt, 
Germany) may be used in transformation experiments in order to provide for a selectable 
marker. This plasmid contains the Pat gene (see European Patent Publication 0 242 236) 

10 which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers 

resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat 
gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus 
(Odell et al. (1985) Nature 313:810-812) and the 3' region of the nopaline synthase gene 
from the T-DN A of the Ti plasmid of Agrobacterium tumefaciens. 

15 The particle bombardment method (Klein. T. M. et al. ( 1 987) Nature 327:70-73) may 

be used to transfer genes to the callus culture cells. According to this method, gold particles 
(1 urn in diameter) are coated with DNA using the following technique. Ten \xg of plasmid 
DNAs are added to 50 \xL of a suspension of gold particles (60 mg per mL). Calcium 
chloride (50 |iL of a 2.5 M solution) and spermidine free base (20 of a 1.0 M solution) 

20 are added to the particles. The suspension is vortexed during the addition of these solutions. 
After 10 minutes, the tubes are briefly centrifuged (5 sec at 1 5,000 rpm) and the supernatant 
removed. The particles are resuspended in 200 uL of absolute ethanol, centrifuged again 
and the supernatant removed. The ethanol rinse is performed again and the particles 
resuspended in a final volume of 30 \xL of ethanol. An aliquot (5 |iL) of the DNA-coated 

25 gold particles can be placed in the center of a Kapton™ flying disc (Bio-Rad Labs). The 
particles are then accelerated into the corn tissue with a Biolistic™ PDS-1000/He (Bio-Rad 
Instruments, Hercules CA), using a helium pressure of 1000 psi, a gap distance of 0.5 cm 
and a flying distance of 1 .0 cm. 

For bombardment, the embryogenic tissue is placed on filter paper over agarose- 

30 solidified N6 medium. The tissue is arranged as a thin lawn and covered a circular area of 
about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of 
the PDS-1000/He approximately 8 cm from the stopping screen. The air in the chamber is 
then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a 
helium shock wave using a rupture membrane that bursts when the He pressure in the shock 

35 tube reaches 1 000 psi. 

Seven days after bombardment the tissue can be transferred to N6 medium that 
contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to 
grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to 
fresh N6 medium containing gluphosinate. After 6 weeks, areas of about 1 cm in diameter 
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of actively growing callus can be identified on some of the plates containing the glufosinate- 
supplemented medium. These calli may continue to grow when sub-cultured on the 
selective medium. 

Plants can be regenerated from the transgenic callus by first transferring clusters of 
tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the 
tissue can be transferred to regeneration medium (Fromm et al. (1 990) Bio/Technology 
5:833-839). 

EXAMPLE 6 
Expression of Chimeric Genes in Dicot Cells 

A seed-specific expression cassette composed of the promoter and transcription 
terminator from the gene encoding the P subunit of the seed storage protein phaseolin from 
the bean Phaseolus vulgaris (Doyle et al. (1986) J. BioL Chem. 261 :9228-9238) can be used 
for expression of the instant triacylglycerol lipase in transformed soybean. The phaseolin 
cassette includes about 500 nucleotides upstream (5') from the translation initiation codon 
and about 1650 nucleotides downstream (3') from the translation stop codon of phaseolin. 
Between the 5' and 3' regions are the unique restriction endonuclease sites Nco I (which 
includes the ATG translation initiation codon), Sma L Kpn I and Xba I. The entire cassette 
is flanked by Hind III sites. 

The cDNA fragment of this gene may be generated by polymerase chain reaction 
(PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be 
incorporated into the oligonucleotides to provide proper orientation of the DN A fragment 
when inserted into the expression vector. Amplification is then performed as described 
above, and the isolated fragment is inserted into a pUCl 8 vector carrying the seed 
expression cassette. 

Soybean embroys may then be transformed with the expression vector comprising 
sequences encoding a triacylglycerol lipase. To induce somatic embryos, cotyledons. 
3-5 mm in length dissected from surface sterilized, immature seeds of the soybean cultivar 
A2872, can be cultured in the light or dark al 26°C on an appropriate agar medium for 
6-10 weeks. Somatic embryos which produce secondary embryos are then excised and 
placed into a suitable liquid medium. After repeated selection for clusters of somatic 
embryos which multiplied as early, globular staged embryos, the suspensions are maintained 
as described below. 

Soybean embryogenic suspension cultures can maintained in 35 mL liquid media on a 
rotary shaker. 1 50 rpm, at 26°C with florescent lights on a 1 6:8 hour day/night schedule. 
Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 
35 mL of liquid medium. 

Soybean embryogenic suspension cultures may then be transformed by the method of 
particle gun bombardment (Klein T. M. et al. (1987) Nature (London) J27;70-73, U.S. 
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Patent No. 4,945,050). A DuPont Biolistic™ PDS1 000/HE instrument (helium retrofit) can 
be used for these transformations. 

A selectable marker gene which can be used to facilitate soybean transformation is a 
chimeric gene composed of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. 
5 (1985) Nature 575:810-812), the hygromycin phosphotransferase gene from plasmid pJR225 
(from E, coin Gritzet al. (1983) Gene 25:179-188) and the 3' region of the nopaline synthase 
gene from the T-DNA of the Ti plasmid of Agrobacierium tumefaciens. The seed expression 
cassette comprising the phaseolin 5' region, the fragment encoding the triacylglycerol lipase 
and the phaseolin 3' region can be isolated as a restriction fragment. This fragment can then 

10 be inserted into a unique restriction site of the vector carrying the marker gene. 

To 50 \xL of a 60 mg/mL 1 |im gold particle suspension is added (in order): 5 \xh 
DNA (1 ^g/^L), 20 ^1 spermidine (0.1 M), and 50 f*L CaCl 2 (2.5 M). The particle 
preparation is then agitated for three minutes, spun in a micro fuge for 10 seconds and the 
supernatant removed. The DNA-coated particles are then washed once in 400 (iL 70% 

15 ethanol and resuspended in 40 of anhydrous ethanol. The DNA/particle suspension can 
be sonicated three times for one second each. Five fiL of the DNA-coated gold particles are 
then loaded on each macro carrier disk. 

Approximately 300-400 mg of a two-week-old suspension culture is placed in an 
empty 60x1 5 mm petri dish and the residual liquid removed from the tissue with a pipette. 

20 For each transformation experiment, approximately 5-10 plates of tissue are normally 

bombarded. Membrane rupture pressure is set at 1 100 psi and the chamber is evacuated to a 
vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the 
retaining screen and bombarded three times. Following bombardment, the tissue can be 
divided in half and placed back into liquid and cultured as described above. 

25 Five to seven days post bombardment, the liquid media may be exchanged with fresh 

media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL 
hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post 
bombardment, green, transformed tissue may be observed growing from untransformed, 
necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into 

30 individual flasks to generate new, clonally propagated, transformed embryogenic suspension 
cultures. Each new line may be treated as an independent transformation event. These 
suspensions can then be subcultured and maintained as clusters of immature embryos or 
regenerated into whole plants by maturation and germination of individual somatic embryos. 

EXAMPLE 7 

35 Expression of Chimeric Genes in Microbial Cells 

The cDNAs encoding the instant triacylglycerol lipases can be inserted into the T7 
E. coli expression vector pBT430. This vector is a derivative of pET-3a (Rosenberg et al. 
(1987) Gene 56:125-135) which employs the bacteriophage T7 RNA polymerase/T7 
promoter system. Plasmid pBT430 was constructed by first destroying the EcoR I and 
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Hind III sites in pET-3a at their original positions. An oligonucleotide adaptor containing 
EcoR I and Hind III sites was inserted at the BamH I site of pET-3a. This created pET-3aM 
with additional unique cloning sites for insertion of genes into the expression vector. Then, 
the Nde I site at the position of translation initiation was converted to an Nco I site using 
5 oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 
5-CATATGG, was converted to 5-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a nucleic 
acid fragment encoding the protein. This fragment may then be purified on a 1% NuSieve 
GTG™ low melting agarose gel (FMC). Buffer and agarose contain 10 ^ig/ml ethidium 

10 bromide for visualization of the DNA fragment. The fragment can then be purified from the 
agarose gel by digestion with GELase™ (Epicentre Technologies) according to the 
manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 uL of water. 
Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated adapters can be 

15 purified from the excess adapters using low melting agarose as described above. The vector 
pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and deproteinized 
with phenol/chloroform as described above. The prepared vector pBT430 and fragment can 
then be ligated at 1 6°C for 1 5 hours followed by transformation into DH5 electrocompetent 
cells (GIBCO BRL). Transformants can be selected on agar plates containing LB media and 

20 100 jig/mL ampicillin. Transformants containing the gene encoding the triacylglycerol 
lipase are then screened for the correct orientation with respect to the T7 promoter by 
restriction enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
orientation relative to the T7 promoter can be transformed into £ coli strain BL21(DE3) 

25 (Studier et al. (1 986) J. Mol Biol 189: 1 1 3-1 30). Cultures are grown in LB medium 

containing ampicillin (100 mg/L) at 25°C. At an optical density at 600 nm of approximately 
1, IPTG (isopropylthio-P-galactoside, the inducer) can be added to a final concentration of 
0.4 mM and incubation can be continued for 3 h at 25°. Cells are then harvested by 
centrifugation and re-suspended in 50 \iL of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM 

30 DTT and 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can 
be added and the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centrifuged and the protein concentration of the supernatant 
determined. One u.g of protein from the soluble fraction of the culture can be separated by 
SDS-polyacrylamide gel electrophoresis. Gels can be observed for protein bands migrating 

35 at the expected molecular weight. 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid fragment encoding all or a substantial portion of an 
acid triacylglycerol lipase comprising a member selected from the group consisting of: 

5 (a) an isolated nucleic acid fragment encoding all or a substantial portion of 

the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:8, SEQ ID NO:10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 
NO: 16, SEQ ID NO: 18 and SEQ ID NO:20; 

10 (b) an isolated nucleic acid fragment that is substantially similar to an 

isolated nucleic acid fragment encoding all or a substantial portion of 
the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6 ? SEQ ID 
NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 

15 NO:16, SEQ ID NO:18 and SEQ ID NO:20; and 

(c) an isolated nucleic acid fragment that is complementary to (a) or (b). 

2. The isolated nucleic acid fragment of Claim 1 wherein the nucleotide sequence 
of the fragment comprises all or a portion of the sequence set forth in a member selected 
from the group consisting of SEQ ID NO:l , SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, 

20 SEQ ID NO:9, SEQ ID NO: 1 1 , SEQ ID NO: 1 3, SEQ ID NO: 1 5, SEQ ID NO: 1 7 and SEQ 
ID NO: 19. 

3. A chimeric gene comprising the nucleic acid fragment of Claim 1 operably 
linked to suitable regulatory sequences. 

4. A transformed host cell comprising the chimeric gene of Claim 3. 

25 5. An acid triacylglycerol lipase polypeptide comprising all or a substantial 

portion of the amino acid sequence set forth in a member selected from the group consisting 
of SEQ ID NO;2 ? SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID 
NO: 1 2, SEQ ID NO: 1 4, SEQ ID NO: 1 6, SEQ ID NO: 18 and SEQ ID NO:20. 

6. An isolated nucleic acid fragment encoding all or a substantial portion of a 
30 neutral triacylglycerol lipase comprising a member selected from the group consisting of: 

(a) an isolated nucleic acid fragment encoding all or a substantial portion of 
the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26 ? SEQ ID 
NO:28, SEQ ID NO:30, SEQ ID NO:32 and SEQ ID NO:34; 
35 (b) an isolated nucleic acid fragment that is substantially similar to an 

isolated nucleic acid fragment encoding all or a substantial portion of 
the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, SEQ ID NO:32 and SEQ ID NO:34; and 
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(c) an isolated nucleic acid fragment that is complementary to (a) or (b). 

7. The isolated nucleic acid fragment of Claim 6 wherein the nucleotide sequence 
of the fragment comprises all or a portion of the sequence set forth in a member selected 
from the group consisting of SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID 

5 NO:27, SEQ ID NO:29, SEQ ID NO:3 1 and SEQ ID NO:33. 

8. A chimeric gene comprising the nucleic acid fragment of Claim 6 operably 
linked to suitable regulatory sequences. 

9. A transformed host cell comprising the chimeric gene of Claim 8. 

1 0. A neutral triacylglycerol lipase polypeptide comprising all or a substantial 

10 portion of the amino acid sequence set forth in a member selected from the group consisting 
of SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ 
ID NO:32 and SEQ ID NO:34. 

11. A method of altering the level of expression of a triacylglycerol lipase in a host 
cell comprising: 

1 5 (a) transforming a host cell with the chimeric gene of any of Claims 3 and 

8; and 

(b) growing the transformed host cell produced in step (a) under conditions 
that are suitable for expression of the chimeric gene 

wherein expression of the chimeric gene results in production of altered levels of a 
20 triacylglycerol lipase in the transformed host cell. 

12. A method of obtaining a nucleic acid fragment encoding all or a substantial 
portion of the amino acid sequence encoding a triacylglycerol lipase comprising: 

(a) probing a cDNA or genomic library with the nucleic acid fragment of 
any of Claims I and 6; 

25 (b) identifying a DNA clone that hybridizes with the nucleic acid fragment 

of any of Claims 1 and 6; 

(c) isolating the DNA clone identified in step (b); and 

(d) sequencing the cDNA or genomic fragment that comprises the clone 
isolated in step (c) 

30 wherein the sequenced nucleic acid fragment encodes all or a substantial portion of the 
amino acid sequence encoding a triacylglycerol lipase. 

13. A method of obtaining a nucleic acid fragment encoding a substantial portion 
of an amino acid sequence encoding a triacylglycerol lipase comprising: 

(a) synthesizing an oligonucleotide primer corresponding to a portion of 
35 the sequence set forth in any of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 

17, 19, 21, 23, 25, 27, 29, 31 and 33: and 

(b) amplifying a cDNA insert present in a cloning vector using the 
oligonucleotide primer of step (a) and a primer representing sequences 
of the cloning vector 
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wherein the amplified nucleic acid fragment encodes a substantial portion of an amino acid 
sequence encoding a triacylglycerol lipase. 

14. The product of the method of Claim 1 2. 

15. The product of the method of Claim 13. 
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SEQUENCE LISTING 

<110> E. I. DU PONT DE NEMOURS AND COMPANY 

<120> TR I AC YLGLYCEROL LIPASES 

<130> BB-1168 

<140> 
<141> 


<150> 
<151> 


60/083, 688 
1988-04-30 


<160> 36 


<170> Microsoft Office 97 


<210> 
<211> 
<212> 
<213> 


1 

751 
DNA 

Zea mays 


<400> 1 

gcacgagatc 

gtacgagccc 

cggcgtgctg 

ggccgagccg 

ctacggcggc 

ccggggccac 

cttcatcatc 

ccgcttcaac 

aggattagac 

cagaggtgaa 

tcagtatgga 

attgtcacac 

ct tgagttaa 


accggcaaga 
cagccgacct 
accaagtacg 
ccggtgtacc 
cgggactcgc 
gaccaggaca 
ggcgtctgcg 
tagtactagc 
aaaaaaaggg 
aaccatacat 
ggattgtcaa 
tgtgtgtgtt 
aaaaaaaaaa 


actactgcct 
ccaccaaaac 
actacgtgct 
ggatgtccgg 
tcgccgaccc 
agctcacggt 
ccaaggacta 
atatatattt 
ggggacactg 
gatgtaattt 
ctactctcca 
gcaaatttgc 
aaaaaaaaaa 


caacagctcc 
catggtccac 
gccggagcgg 
catcccgccg 
cgccgacgtg 
gcagtacctg 
cgtctacaag 
gcttcaatcg 
cagctcgtaa 
agcattagat 
tcacagcagt 
aacacagtga 
a 


gccgtcgacg 
ttcgctcaaa 
aacatcgcca 
agcttcccgc 
cgcctcctcc 
gacaagttcg 
gacatgatcg 
gtgtcgtctt 
acgttgtcca 
agttaaaaca 
aggtgtgatg 
ttactaatat 


tcttcctcaa 60 
ccgtgcgcga 120 
gctacggcca 180 
tcttcctcag 240 
tgcaggacct 300 
cgcacctcga 360 
acttcctaaa 420 
cagccccagc 4 80 
tacagattat 540 
tggagctgcc 600 
tagaagagtg 660 
aaaaaatact 720 
751 


<210> 2 

<211> 143 

<212> PRT 

<213> Zea mays 

<400> 2 

His Glu He Thr Gly Lys Asn Tyr Cys Leu Asn Ser Ser Ala Val Asp 
15 10 15 

Val Phe Leu Lys Tyr Glu Pro Gin Pro Thr Ser Thr Lys Thr Met Val 
20 25 30 

His Phe Ala Gin Thr Val Arg Asp Gly Val Leu Thr Lys Tyr Asp Tyr 
35 40 45 

Val Leu Pro Glu Arg Asn He Ala Ser Tyr Gly Gin Ala Glu Pro Pro 
50 55 60 

Val Tyr Arg Met Ser Gly He Pro Pro Ser Phe Pro Leu Phe Leu Ser 
65 70 75 80 


Tyr Gly Gly Arg Asp Ser Leu Ala Asp Pro Ala Asp Val Arg Leu Leu 
85 90 95 
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Leu Gin Asp Leu Arg Gly His Asp Gin Asp Lys Leu Thr Val Gin Tyr 
100 105 110 

Leu Asp Lys Phe Ala His Leu Asp Phe lie lie Gly Val Cys Ala Lys 
115 120 125 

Asp Tyr Val Tyr Lys Asp Met He Asp Phe Leu Asn Arg Phe Asn 
130 135 140 

<210> 3 

<211> 647 

<212> DNA 

<213> Catalpa sp. 


<400> 3 

ttatctttca 

taccctccat 

gatcctaccc 

cagacctccc 

tggaggacaa 

gttacacgat 

cattatcgga 

ccaggcttga 

ccactcttgt 

tagtttactc 

gaaaaggtat 


ggagagattt 
cttccgtgaa 
aaatacgact 
gtttacgatt 
gatgcattgt 
gtggataagc 
gttactgcaa 
gaggttcttg 
tgaatgtgaa 
atagatacat 
tgcatctcct 


ttgtttgaat 
ttgagacccc 
acggcaatcc 
tatccaagat 
cggatgttaa 
tgcatgtgca 
aagatatagt 
attttggagt 
taagccattt 
gtaagaagca 
cttctacgtc 


gctccccccg 
tgtccatatg 
cagcttcaac 
tcccctcgac 
ggatgtcgag 
gtatatcaag 
ttataatcag 
gcttttgctg 
ccgagagatt 
acccgataca 
aaaaaaaaaa 


ttgagctttt 
gctcaaactg 
ttggcccatt 
attccgctct 
acattgctcg 
gattatgctc 
attgtaactt 
tgagaatgca 
taatggctgg 
tagtttgaat 
aaaaata 


tgtggaaaat 
tccgatatgg 
atggtgaatc 
tcctaagcta 
atagtctcaa 
atgccgactt 
ttttcagaaa 
acagcttgtt 
taaagcttat 
cctttatctc 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
647 


<210> 4 

<211> 116 

<212> PRT 

<213> Catalpa sp. 

<400> 4 

He Glu Thr Pro Val His Met Ala Gin Thr Val Arg Tyr Gly He Leu 
15 10 15 

Pro Lys Tyr Asp Tyr Gly Asn Pro Ser Phe Asn Leu Ala His Tyr Gly 
20 25 30 

Glu Ser Arg Pro Pro Val Tyr Asp Leu Ser Lys He Pro Leu Asp He 
35 40 45 

Pro Leu Phe Leu Ser Tyr Gly Gly Gin Asp Ala Leu Ser Asp Val Lys 
50 55 60 

Asp Val Glu Thr Leu Leu Asp Ser Leu Lys Leu His Asp Val Asp Lys 
65 70 75 80 

Leu His Val Gin Tyr He Lys Asp Tyr Ala His Ala Asp Phe He He 
85 90 95 

Gly Val Thr Ala Lys Asp He Val Tyr Asn Gin He Val Thr Phe Phe 
100 105 HO 

Arg Asn Gin Ala 
115 


<210> 5 
<211> 705 
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<212> DNA 

<213> Catalpa sp. 

<220> 

<221> unsure 
<222> (526) 

<220> 

<221> unsure 
<222> (561) 

<220> 

<221> unsure 
<222> (585) 

<220> 

<221> unsure 
<222> (593) 

<220> 

<221> unsure 
<222> (664) 

<220> 

<221> unsure 
<222> (679) 

<400> 5 

gcacgagcca acagcttcct aaatttagct cttctaatcc ttctctcatt atcactactc 60 
ctacctcatc aatcattcgc ctccagccgc cgccgttttc ttccgcagaa tgatgtcgtt 120 
cttccgccgg acggcgtttg ctccaccgcc gtaactgtac acggttataa atgccaagaa 180 
tttgaagtaa cgactgatga tggctatata ttaagcgtgc agaggattct ggagggccgg 240 
gccggaggag gagggccgaa gcggccgccg gttctgctgc aacatggggt tcttgtggac 300 
gggatgacgt ggctggtgaa tggaccggaa caatctttgg cgatgatatt ggctgacaat 360 
gggttcgacg tctggatttc taacataaga ggaactaggt ttagtcgtcg tcatgtcagc 420 
cttgatccta ccgatcctga atattgggat tgggcatggg acgatcttgg tgacccacga 4 80 
cttaccatcc ctgatcgagt tagtggtcag acaaacgggt cagaanacac actacatagg 540 
gcaatccatg gggaacttta ntagctttgg gatcactttt agganggaaa cangttggca 600 
gggtaaatcg gctgtatgtt aagccaattg gctaacgagt catatgcaac tgctctcgag 660 
ttgnctagca gatccttgnt ggggaacaca cgatcttggc ctgcg 705 

<210> 6 

<211> 157 

<212> PRT 

<213> Catalpa sp. 

<400> 6 

Ala Arg Ala Asn Ser Phe Leu Asn Leu Ala Leu Leu lie Leu Leu Ser 
15 10 15 

Leu Ser Leu Leu Leu Pro His Gin Ser Phe Ala Ser Ser Arg Arg Arg 
20 25 30 

Phe Leu Pro Gin Asn Asp Val Val Leu Pro Pro Asp Gly Val Cys Ser 
35 40 45 

Thr Ala Val Thr Val His Gly Tyr Lys Cys Gin Glu Phe Glu Val Thr 
50 55 60 

Thr Asp Asp Gly Tyr He Leu Ser Val Gin Arg He Leu Glu Gly Arg 
65 70 75 80 
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Ala Gly Gly Gly Gly Pro Lys Arg 
85. 

Val Leu Val Asp Gly Met Thr Trp 
100 

Leu Ala Met lie Leu Ala Asp Asn 
115 120 

lie Arg Gly Thr Arg Phe Ser Arg 
130 135 

Asp Pro Glu Tyr Trp Asd Trp Ala 
145 150 

<210> 7 

<211> 859 

<212> DNA 

<213> Zea mays 


Pro Pro Val Leu Leu Gin His Gly 
90 95 

Leu Val Asn Gly Pro Glu Gin Ser 
105 no 

Gly Phe Asp Val Trp lie Ser Asn 
125 

Arg His Val Ser Leu Asp Pro Thr 
140 


Trp Asp Asp Leu Gly 
155 


<220> 

<221> unsure 

<222> (46) 

<220> 

<221> unsure 

<222> (231) 


<400> 7 

aaagcaaaca acggcggaca tggtgcgccc 
cctcctcgtg ttcctctgcc tcctagccgg 
gctacgccgc gtctccccgc gcgcgggggc 
gcaggttacc cgtgcaccga gcacaccgtt 
cagcatattc cacatggcag aaatggaatt 
cagcacggtc ttttccaggg tggagataca 
ggatatatcc ttgctgacaa tggttttgat 
tggagtaaag gccactctac tctctctgtt 
caagaccttg ctgaatacga cgttttggca 
tccaaaattt tgtatgtggg acattcacag 
atgcctgaaa cagtaaagat gataagctct 
gatcacgtca gtgctagttt tgttcttaga 
gttattatgg gcatccatca gttgaacttc 
tcgctgtgcg atgatgaaca tttggactgc 
actgttgttc aatcatctc 

<210> 8 

<211> 286 

<212> PRT 

<213> Zea mays 

<220> 

<221> UNSURE 
<222> (16) 


aggaaaagcg cttgcngcgc cccagctcct 60 
tggagcccgc gcatccccgc ccacagacgc 120 
cggtggcctc tgccagcagc tgctcctgcc 180 
caaacggatg atggctttct nttgtctctt 240 
gcagataata ctggacctcc agtttttctt 300 
tggttcataa actccaatga acaatcactt 360 
gtttgggtcg gaaatgttcg tggcacacgt 420 
catgataagc ttttctggga ttggagttgg 480 
atgttaagcr atgtatatac agttgcacag 540 
ggaactatca tgggtttggc tgcgtttaca 600 
gctgcgcttc tttgtcccat ttcttacctt 660 
gcagttgcca tgcatcttga tgagatgctt 720 
cggagcgata tgggtgtaca gatattagat 780 
aacgatctgt tatcttcaat aacagtcaaa 840 

859 


<400> 8 

Lys Ala Asn Asn Gly Gly His Gly 
1 5 

Ala Pro Ala Pro Pro Pro Arg Val 
20 


Ala Pro Arg Lys Ser Ala Cys Xaa 
10 15 

Pro Leu Pro Pro Ser Arg Trp Ser 
25 30 
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Pro Arg lie Pro Ala His Arg Arg Ala Thr Pro Arg Leu Pro Ala Arg 
35 40 45 

Gly Gly Arg Trp Pro Leu Pro Ala Ala Ala Pro Ala Ala Gly Tyr Pro 
50 55 60 

Cys Thr Glu His Thr Val Gin Thr Asp Asp Gly Phe Leu Leu Ser Leu 
65 70 75 80 

Gin His lie Pro His Gly Arg Asn Gly lie Ala Asp Asn Thr Gly Pro 
85 90 95 

Pro Val Phe Leu Gin His Gly Leu Phe Gin Gly Gly Asp Thr TrD Phe 
100 105 110 

lie Asn Ser Asn Glu Gin Ser Leu Gly Tyr lie Leu Ala Asp Asn Gly 
115 120 125 

Phe Asp Val Trp Val Gly Asn Val Arg Gly Thr Arg Trp Ser Lys Gly 
130 135 140 

His Ser Thr Leu Ser Val His Asp Lys Leu Phe Trp Asp Trp Ser Trp 
145 150 155 160 

Gin Asp Leu Ala Glu Tyr Asp Val Leu Ala Met Leu Ser Tyr Val Tyr 
165 170 175 

Thr Val Ala Gin Ser Lys He Leu Tyr Val Gly His Ser Gin Gly Thr 
180 185 190 

He Met Gly Leu Ala Ala Phe Thr Met Pro Glu Thr Val Lys Met He 
195 200 205 

Ser Ser Ala Ala Leu Leu Cys Pro He Ser Tyr Leu Asp His Val Ser 
210 215 220 

Ala Ser Phe Val Leu Arg Ala Val Ala Met His Leu Asp Glu Met Leu 
225 230 235 240 

Val He Met Gly He His Gin Leu Asn Phe Arg Ser Asp Met Glv Val 
245 250 * 255 

Gin He Leu Asp Ser Leu Cys Asp Asp Glu His Leu Asp Cys Asn Asp 
260 265 270 

Leu Leu Ser Ser He Thr Val Lys Thr Val Val Gin Ser Ser 
275 280 285 

<210> 9 
<211> 509 
<212> DNA 
<213> Zea mays 

<220> 

<221> unsure 
<222> (162) 

<220> 

<221> unsure 
<222> (277) 
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<220> 

<221> unsure 

<222> (284) 

<220> 

<221> unsure 

<222> (290) 

<220> 

<221> unsure 

<222> (295) 

<220> 

<221> unsure 

<222> (386) 

<220> 

<221> unsure 

<222> (406) 

<220> 

<221> unsure 

<222> (413) 

<220> 

<221> unsure 

<222> (443) 

<220> 

<221> unsure 

<222> (468) 

<220> 

<221> unsure 

<222> (484) 

<220> 

<221> unsure 

<222> (489) 


<400> 9 

cgatcgagat ggctcagaag gatctctatc taccgttcct ggctctttcc atcattgcct 60 

gctgcttgat gaacctgcaa agtgttctca gctcaagcag gatgcggaat actacaaacg 120 

atattagtga tgacaaatgc cccccacaac ctcatccctt angtatgtgc aggtcccgag 180 

tagcagctta cggctatcca tgtgaggaat accatgtgac aacggaggat ggctacatcc 240 

ttagcttaaa gaagatcccc tatggtctct ctggtgncac cganattacn agganaccgg 300 

tactactgtt ccatgggcta ctggtggatg gtttctgttg ggtactgaac acaccaaaac 360 

aatcactggg cttcacctgg ctgaangtgg tttgaaattt ggatcnccac tcncccgaaa 420 

aaaatccacc gagggacaca ccnctccccc aaaaaccggc tttgggangg aatggaacac 480 

tgcnaaaana actcccgcgt gctgaatcc 509 


<210> 10 

<211> 125 

<212> PRT 

<213> Zea mays 

<220> 

<221> UNSURE 

<222> (52) 
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<220> 

<221> UNSURE 
<222> (90) 


<220> 

<221> UNSURE 
<222> (92) 


<220> 

<221> UNSURE 
<222> (96) 


<400> 10 
Met Ala Gin Lys 
1 


Asp Leu Tyr Leu Pro Phe Leu Ala 
5 10 


Leu Ser lie lie 
15 


Ala Cys Cys Leu 
20 


Met Asn Leu Gin Ser Val Leu Ser 
25 


Ser Ser Arg Met 
30 


Arg Asn Thr Thr 
35 


Asn Asp lie Ser Asp Asp Lys Cys 
40 


Pro Pro Gin Pro 
45 


His Pro Leu Xaa 
50 


Met Cys Arg Ser Arg Val Ala Ala Tyr Gly Tyr Pro 
55 60 


Cys Glu Glu Tyr 
65 


His Val Thr Thr Glu Asp Gly Tyr 
70 75 


lie Leu Ser Leu 
80 


Lys Lys He Pro Tyr Gly Leu Ser Gly Xaa Thr Xaa 
85 90 


He Thr Arg Xaa 
95 


Pro Val Leu Leu 
100 


Phe His Gly Leu Leu Val Asp Gly 
105 


Phe Cys Trp Val 
110 


Leu Asn Thr Pro 
115 


Lys Gin Ser Leu Gly Phe Thr TrD Leu 
120 " 125 


<210> 

11 

<211> 

273 

<212> 

DNA 

<213> 

Zea ma; 

<220> 


<221> 

unsure 

<222> 

(8) 

<220> 


<221> 

unsure 

<222> 

(20) 

<220> 


<221> 

unsure 

<222> 

(229) 

<220> 


<221> 

unsure 

<222> 

(236) 

<220> 


<221> 

unsure 

<222> 

(241) 
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<220> 

<221> unsure 

<222> (249) 

<220> 

<221> unsure 

<222> (268) 


<400> 

11 



cttcctcntg 

cacgcttcgn 

tttcagctct 

tcaacgacct 

gccggccatg 

gtcgacttcg 

acgtcggaca 

ctccatgggg 

acgctggtgg 

tgagccagct 

gaaatccgcg 

gcgctgctca 

nccccaatng 

gaatcctggt 

tggccaangc 

<210> 

12 



<211> 

90 



<212> 

PRT 



<213> 

Zea 

mays 


<220> 




<221> 

UNSURE 


<222> 

(76) 




actggaactg gtcctgggat gacctggtag 60 
tcgtcaaaca gaccggccag aagcctcact 120 
cgctggcggc cttctcggag ggccgggtgg 180 
cgccggtggc ctacctcgnc cacatnaaca 24 0 
gtt 273 


<220> 

<221> UNSURE 

<222> (78) 

<220> 

<221> UNSURE 

<222> (80) 

<220> 

<221> UNSURE 

<222> (83) 

<220> 

<221> UNSURE 

<222> (89) 


<400> 12 

Ser Ser Cys Thr Leu Arg Phe Gin 
1 5 

Asp Leu Val Val Asn Asp Leu Pro 
20 

Gin Thr Gly Gin Lys Pro His Tyr 

35 40 

Val Ala Leu Ala Ala Phe Ser Glu 
50 55 

Ser Ala Ala Leu Leu Thr Pro Val 
65 70 

Pro Asn Xaa Asn Pro Gly Trp Pro 
85 


Leu Tyr Trp Asn Trp Ser Trp Asp 
10 15 

Ala Met Val Asp Phe Val Val Lys 
25 30 

Val Gly His Ser Met Gly Thr Leu 
45 

Gly Arg Val Val Ser Gin Leu Lys 
60 

Ala Tyr Leu Xaa His Xaa Asn Xaa 
75 80 

Xaa Arg 
90 
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<210> 13 

<211> 1483 

<212> DNA 

<213> Oryza sativa 


<400> 13 

gcacgagtac 

agcgctcccc 

cccagccgcc 

gctcctgcca 

tttatctctt 

agtttttctt 

gcaatcactt 

tggaacgcgt 

ttggagctgg 

agtcacacag 

ggctttgacg 

ttcttatctt 

tcagatgctt 

aatagtagat 

tacaggggaa 

tcatccatca 

cgcaaagtat 

cgcatttgac 

tgatgcattg 

agaacttctg 

agatgatgtt 

ttaggatgtc 

gagtatgtat 

acgctgtcag 

ttatagaaaa 


acagcgcggc 
ctgatcctcc 
gccgccctcc 
ctcggctacc 
cagcatatcc 
caacatggtc 
gggtatatcc 
tggagtaaag 
caagagttag 
tccaaaattc 
atgcccgaaa 
gatcatgtta 
gttactatgg 
tctttgtgcg 
aactgttgct 
tcgacaaaaa 
gactatgggt 
ctaagcagca 
gctgatgtaa 
tacattggtg 
tatgtggacc 
ttcatgtgta 
atattgcata 
cagcaattgt 
ccctcttcat 


gggcgttggc 
tcgtcgtctc 
gccgcgtcgg 
cctgcaccga 
cacatggcaa 
tttttcaggg 
ttgctgataa 
gtcattcaac 
ctgaatatga 
tatatgtggg 
tagtaaaaat 
gtgctagttt 
gaattcacca 
atggtgaaca 
tcaatacatc 
atctgcacca 
tattgggaaa 
taccagaatc 
ccgatgttca 
actatggcca 
taataagatt 
taataaaaac 
tgagcttgtt 
atcattgtat 
ttcctcttca 


gatggcgatg 
ttgctgcggt 
ctccggctcc 
gcacaacgtt 
aaataaagca 
aggagacaca 
cggttttgat 
cttttctgtt 
ccttttagca 
gcattcacag 
gattagctct 
tgttctcaga 
gctgaacttc 
cgtggattgc 
aaggattgat 
tctttttcag 
cctaaggcgc 
actgcccata 
gcgtactatc 
tattgatttt 
tcttagggaa 
atctgtacag 
ggatctatgg 
ccaacttatc 
aaaaaaaaaa 


gcgggccacg 
cgcatcgtct 
ggcggcctct 
gaaacaaaag 
gcagatagta 
tggttcataa 
gtttggattg 
catgataagc 
atgctaggct 
ggaactataa 
gcagcacttc 
gcagtcgcca 
cgtagcgaca 
aacaatttgc 
tattatttgg 
atgatcagga 
tacggtcatt 
tggatgggat 
agagagctgg 
gttatgagcg 
aatggatggc 
tattggtcct 
tgcatgctct 
gctccactac 
aaa 


cccccggagg 
ccggagcctc 
gcgaccagct 
atggattcct 
ctggccctcc 
actctgctga 
ggaatgtccg 
ttttctggga 
atgtgtatac 
tgggtttggc 
tttgtcctat 
tgcatcttga 
tgggggttca 
tatctgcgat 
agtatgaacc 
aaggcacttt 
tgcgtcctcc 
atggaggtct 
gatctacacc 
tgaaggcgaa 
ataatagcta 
ctcccgatgt 
caagtctaaa 
tgtatatcca 


<210> 14 

<211> 410 

<212> PRT 

<213> Oryza sativa 

<400> 14 

Met Ala Met Ala Gly His Ala Pro Gly Gly Ala Leu Pro Leu He Leu 
1 5 10 15 

Leu Val Val Ser Cys Cys Gly Arg He Val Ser Gly Ala Ser Pro Ala 
20 25 30 

Ala Ala Ala Leu Arg Arg Val Gly Ser Gly Ser Gly Gly Leu Cys Asp 
35 40 45 

Gin Leu Leu Leu Pro Leu Gly Tyr Pro Cys Thr Glu His Asn Val Glu 
50 55 60 

Thr Lys Asp Gly Phe Leu Leu Ser Leu Gin His He Pro His Gly Lys 
65 70 75 80 

Asn Lys Ala Ala Asp Ser Thr Gly Pro Pro Val Phe Leu Gin His Gly 
85 90 95 

Leu Phe Gin Gly Gly Asp Thr Trp Phe He Asn Ser Ala Glu Gin Ser 
100 105 no 

Leu Gly Tyr He Leu Ala Asp Asn Gly Phe Asp Val Trp He Gly Asn 
115 120 125 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1483 
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Val Arg Gly Thr Arg Trp Ser Lys Gly His Ser Thr Phe Ser Val His 
130 135 140 

Asp Lys Leu Phe Trp Asp Trp Ser Trp Gin Glu Leu Ala Glu Tyr Asp 
145 150 155 160 

Leu Leu Ala Met Leu Gly Tyr Val Tyr Thr Val Thr Gin Ser Lys lie 
165 170 175 

Leu Tyr Val Gly His Ser Gin Gly Thr He Met Gly Leu Ala Ala Leu 
180 185 190 

Thr Met Pro Glu He Val Lys Met He Ser Ser Ala Ala Leu Leu Cys 
195 200 205 

Pro He Ser Tyr Leu Asp His Val Ser Ala Ser Phe Val Leu Arg Ala 
210 215 220 

Val Ala Met His Leu Asp Gin Met Leu Val Thr Met Gly He His Gin 
225 230 235 240 

Leu Asn Phe Arg Ser Asp Met Gly Val Gin He Val Asp Ser Leu Cys 
245 250 255 

Asp Gly Glu His Val Asp Cys Asn Asn Leu Leu Ser Ala He Thr Gly 
260 265 270 

Glu Asn Cys Cys Phe Asn Thr Ser Arg He Asp Tyr Tyr Leu Glu Tyr 
275 280 285 

Glu Pro His Pro Ser Ser Thr Lys Asn Leu His His Leu Phe Gin Met 
290 295 300 

He Arg Lys Gly Thr Phe Ala Lys Tyr Asp Tyr Gly Leu Leu Gly Asn 
305 310 315 320 

Leu Arg Arg Tyr Gly His Leu Arg Pro Pro Ala Phe Asp Leu Ser Ser 
325 330 335 

lie Pro Glu Ser Leu Pro He Trp Met Gly Tyr Gly Gly Leu Asp Ala 
340 345 350 

Leu Ala Asp Val Thr Asp Val Gin Arg Thr He Arg Glu Leu Gly Ser 
355 360 365 

Thr Pro Glu Leu Leu Tyr lie Gly Asp Tyr Gly His lie Asp Phe Val 
370 375 380 

Met Ser Val Lys Ala Lys Asp Asp Val Tyr Val Asp Leu He Arg Phe 
385 390 395 400 

Leu Arg Glu Asn Gly Trp His Asn Ser Tyr 


405 


410 


<210> 
<211> 
<212> 
<213> 


Oryza sativa 


15 

395 

DNA 


WO 99/55883 


PCT/US99/09280 


<220> 

<221> unsure 
<222> (12) 

<220> 

<221> unsure 
<222> (24) 

<220> 

<221> unsure 
<222> (29) 

<220> 

<221> unsure 
<222> (33) 

<220> 

<221> unsure 
<222> (43) 

<220> 

<221> unsure 

<222> (78) 

<220> 

<221> unsure 

<222> (182) 

<220> 

<221> unsure 

<222> (265) 

<220> 

<221> unsure 

<222> (300) 

<220> 

<221> unsure 

<222> (302) 

<220> 

<221> unsure 

<222> (306) 

<220> 

<221> unsure 

<222> (347) 

<220> 

<221> unsure 

<222> (351) 

<220> 

<221> unsure 

<222> (367) 

<220> 

<221> unsure 

<222> (370) 
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<220> 

<221> unsure 
<222> (380) 


<220> 

<221> unsure 
<222> (386) 


<400> 15 

acatctttca cnggcaaaaa 
tacgagcccc agccaacntc 
ggggttctga ccaagtacga 
gncgacccgc cggcgtacga 
tacggcggcc gggactcgct 
cngccncggc ggccacgtcg 
cccacgnccn acttcgtcan 

<210> 16 

<211> 80 

<212> PRT 

<213> Oryza sativa 


ctantgtcng aanaattcag 
cacaaaaacc ttgatccatc 
ctacgtgatg ccggacgcga 
catggcggcg atcccggcgt 
gtccnacccc gccgatcgtc 
gcgaccggct catccgtgcc 
tcgggntttc tgcgc 


canccgacat cttcctcaag 60 
tcgctcaaac ggtgagagac 120 
acgtggccag gtacgggcag 180 
ggttccccat cttcctcagc 240 
gccctcctcc tcgacgatcn 300 
agtaacnttc nccatactcg 360 

395 


<220> 

<221> UNSURE 
<222> (8) 


<220> 

<221> UNSURE 
<222> (10).. (11) 


<220> 

<221> UNSURE 

<222> (15) 

<220> 

<221> UNSURE 

<222> (61) 


<400> 16 

Thr Ser Phe Thr Gly Lys Asn Xaa 
1 5 

lie Phe Leu Lys Tyr Glu Pro Gin 
20 

His Leu Ala Gin Thr Val Arg Asp 

35 40 

Val Met Pro Asp Ala Asn Val Ala 
50 55 

Ala Tyr Asp Met Ala Ala He Pro 
65 70 


Cys Xaa Xaa Asn Ser Ala Xaa Asp 
10 15 

Pro Thr Ser Thr Lys Thr Leu He 
25 30 

Gly Val Leu Thr Lys Tyr Asp Tyr 
45 

Arg Tyr Gly Gin Xaa Asp Pro Pro 
60 

Ala Trp Phe Pro He Phe Leu Ser 
75 80 


<210> 17 

<211> 1718 

<212> DNA 

<213> Glycine max 
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<400> 17 

ggaatcaaat 

actttttctt 

aatacattaa 

aacatggctc 

actgtgcctc 

tcagtgtatg 

gaggttacaa 

aaaagcagtg 

gatggtatga 

aatggatttg 

tcattggacc 

gatttccctg 

ggccattcat 

cagctgaaat 

cttggtgttg 

gaatttaatc 

gggatagact 

tcaactgttg 

cacttggctc 

tataacatta 

cacgatctcc 

gttgagaatt 

atccaggaat 

aatgctgttc 

tatgaaaaga 

gtggagatta 

ttggtgttgc 

tggcttttgg 

ttataaaaac 


attcaactcg 
tacctttatt 
cacttcaatc 
ttctaggctt 
gtcaagcaca 
gcatatgtgc 
ctgatgatgg 
ggagtgggac 
catggcttct 
acgtgtggat 
cctctagcca 
cggtgtttaa 
tgggaacttt 
cagcagcctt 
ttgcacccaa 
caaaagggtt 
gctatgactt 
atctattctt 
agactgttag 
tgcactatgg 
ctctcttcat 
tgcttgataa 
atgctcatgc 
tttcattttt 
gtgccttcat 
agtaacggct 
aaatggctat 
cttcagttat 
tatgtttcca 


ttttcccatc 
gttccaatct 
ccacgctttc 
aatgagtttt 
cgcttcaagc 
ctcttctgtc 
ttacattctg 
aaggaagcaa 
aaacccacca 
tgcaaacaca 
ggcctattgg 
ttatgtgttc 
ggtagctttg 
gttgagccct 
gtcctttgtt 
agctgttgat 
gttgactgca 
gatgaatgag 
acttggggcg 
agaaatattt 
tagctatggt 
actcaagttc 
tgactacatt 
caatcatcaa 
gtattaggta 
aattacaaaa 
tgcatctatc 
ctaagatgaa 
aaaaaaaaaa 


cttttgtgtc 
tatcctatcc 
aatagataga 
gctgccttga 
cgtggcaact 
attgtgcatg 
agcctgcaaa 
ccagtggtta 
gagcaagatc 
agaggaacca 
aattggtctt 
agccaaacgg 
gcatccttct 
atagcctatt 
ggtgagatca 
gcctttctca 
ctaactggta 
cctcagtcaa 
ttgacaaaat 
cctccaatct 
ggaagagatg 
catgatgaga 
atggggttca 
gtttaacact 
gctatcattg 
gtaatgaagt 
tattgtgttg 
aaacgtggar 
aaaaaaaa 


tctctctttc 
tttaaatata 
tagagcattc 
cccttttctt 
taggcagaaa 
gatacaagtg 
ggatcccaga 
tacaacatgg 
tgccgttgat 
gatatagtcg 
gggatgaact 
ggcagaagat 
cggaaggaaa 
taagccacat 
ctaccctctt 
agtctctctg 
aaaattgctg 
catcaacaaa 
tcaattatgt 
ataacctttc 
cactttcaga 
acaagcgcag 
atgccaagga 
ggatagaatg 
agatcaatct 
attatcacta 
cattgtaatg 
gagatcattt 


cgtttcatac 
cacacacaaa 
attcatcacc 
ggtcctaaca 
catcaaccct 
tcaagaacac 
aggtcgaggt 
agttcttgta 
tttagctgat 
ccgacacatc 
tgtctcctat 
caattacgtt 
attggttacc 
gaatacagca 
cggtctagca 
tgctcaccct 
cctcaattct 
gaacatggtg 
gagaccagac 
caacatcccc 
tgtccgtgat 
cgttcagttc 
cttggtgtat 
aatcaagttg 
aagrtatcta 
gtgatttgct 
cagaggaaag 
atcaaaagaa 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1718 


<210> 18 

<211> 410 

<212> PRT 

<213> Glycine max 

<400> 18 

Met Ala Leu Leu Gly Leu Met Ser Phe Ala Ala Leu Thr Leu Phe Leu 
15 10 15 

Val Leu Thr Thr Val Pro Arg Gin Ala His Ala Ser Ser Arg Gly Asn 
20 25 30 

Leu Gly Arg Asn lie Asn Pro Ser Val Tyr Gly He Cys Ala Ser Ser 
35 40 45 

Val He Val His Gly Tyr Lys Cys Gin Glu His Glu Val Thr Thr Asp 
50 55 60 

Asp Gly Tyr He Leu Ser Leu Gin Arg He Pro Glu Gly Arg Gly Lys 
65 70 75 80 

Ser Ser Gly Ser Gly Thr Arg Lys Gin Pro Val Val He Gin His Gly 
85 90 95 

Val Leu Val Asp Gly Met Thr Trp Leu Leu Asn Pro Pro Glu Gin Asp 
100 105 110 


Leu Pro Leu He Leu Ala Asp Asn Gly Phe Asp Val Trp He Ala Asn 
115 120 125 
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Thr Arg Gly Thr Arg Tyr Ser Arg Arg His lie Ser Leu Asp Pro Ser 
130 135 140 

Ser Gin Ala Tyr Trp Asn Trp Ser Trp Asp Glu Leu Val Ser Tyr Asp 
145 150 155 " 160 

Phe Pro Ala Val Phe Asn Tyr Val Phe Ser Gin Thr Gly Gin Lys lie 
165 170 175 

Asn Tyr Val Gly His Ser Leu Gly Thr Leu Val Ala Leu Ala Ser Phe 
180 185 190 

Ser Glu Gly Lys Leu Val Thr Gin Leu Lys. Ser Ala Ala Leu Leu Ser 
195 200 205 

Pro lie Ala Tyr Leu Ser His Met Asn Thr Ala Leu Gly Val Val Ala 
210 215 220 

Pro Lys Ser Phe Val Gly Glu lie Thr Thr Leu Phe Gly Leu Ala Glu 
225 230 235 240 

Phe Asn Pro Lys Gly Leu Ala Val Asp Ala Phe Leu Lys Ser Leu Cys 


Ala His Pro Gly lie Asp Cys Tyr Asp Leu Leu Thr Ala Leu Thr Gly 
260 265 270 

Lys Asn Cys Cys Leu Asn Ser Ser Thr Val Asp Leu Phe Leu Met Asn 
275 280 285 

Glu Pro Gin Ser Thr Ser Thr Lys Asn Met Val His Leu Ala Gin Thr 
290 295 300 

Val Arg Leu Gly Ala Leu Thr Lys Phe Asn Tyr Val Arg Pro Asp Tyr 
305 310 315 320 

Asn He Met His Tyr Gly Glu He Phe Pro Pro He Tyr Asn Leu Ser 
325 330 335 

Asn He Pro His Asp Leu Pro Leu Phe He Ser Tyr Gly Gly Arg Asp 
340 345 350 

Ala Leu Ser Asp Val Arg Asp Val Glu Asn Leu Leu Asp Lys Leu Lys 
355 360 365 

Phe His Asp Glu Asn Lys Arg Ser Val Gin Phe He Gin Glu Tyr Ala 
370 375 380 

His Ala Asp Tyr He Met Gly Phe Asn Ala Lys Asp Leu Val Tyr Asn 
385 390 395 400 

Ala Val Leu Ser Phe Phe Asn His Gin Val 


245 


250 


255 


405 


410 


<210> 
<211> 
<212> 
<213> 


Glycine max 


19 

1438 
DNA 
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<400> 19 

gcaattcaga 

ataactgtct 

caccaaaaac 

tccgagcata 

tcttcttctc 

ggattattca 

atacttgcag 

catggacata 

ttagccctgt 

atatttgtag 

gagatagtag 

gtcagtgcac 

atgggcattc 

tgtgataccc 

ttcaatgagt 

aacttgaacc 

aagctaaaaa 

atacccaaat 

actgatttcc 

aactatggtc 

atgattagtt 

ggtatgatgg 

aaagttgaaa 

acagatatat 


ataacaataa 
ccatactctt 
agcaacacag 
cgattcaaac 
ttcggcttcg 
tggcaggtga 
atcatggttt 
tatctttatt 
atgatgttgc 
ttgggcattc 
aaaaggttga 
ctcttgtact 
atcaactaaa 
gcctaagttg 
cacgtgtgga 
accttttcca 
atctgataga 
cattgcctct 
agcacacact 
atgttgactt 
ttttcaagtc 
atgtaattac 
tattaatatt 
aaagatcggc 


agggtggatg 
gggaaatgga 
tttgtgtgaa 
gaaggatggt 
gaaccatgga 
tgcatggttt 
tgatgtttgg 
agagaagaaa 
ggaaatgatc 
acaggggaca 
ggctgcagct 
tagaatggtt 
cttcaaaagc 
caatgacatg 
gttttatctt 
gatgatccgc 
gtacggcaag 
gtggatggct 
caaggaattg 
cattttaagc 
atccggaaaa 
tgtaarggtc 
ctgtggagtc 
gtcgcatgac 


aggatccaga 
aaccccgttc 
gagctcatta 
ttcttgttag 
gatggaggcc 
ctaaatactc 
gtaggaaacg 
aagcaatttt 
aattacatta 
attatatctt 
cttctatctc 
aagatgcaca 
gaatgggggg 
ctttcatcca 
gaacaagaac 
aaaggtacct 
ttcaatccac 
tacggtggaa 
ccatccccac 
ttgcaagcaa 
tttagtagta 
tacgggtcca 
caccttgatt 
ctgtttctgc 


ggttcttggc 
agtgcttcga 
tcccctacgg 
gtcttcaacg 
ctccggttct 
cggaacaatc 
tgcgtggaac 
gggattggag 
attcagtaac 
tggctgcctt 
caatatcata 
ttgatgagat 
ccagtctctt 
taacagggaa 
ctcatccatc 
actccaagta 
caaagttcga 
atgatgctct 
cggaagtggt 
aacaagatct 
tgtaatgttt 
tctattactg 
ttctgtatgt 
aaaaaaaaaa 


cacactggcc 
cggcggtagc 
ttacccctgc 
tgtctcttct 
gcttctgcat 
acttggcttc 
acgctggagc 
ttggcaggaa 
aaactcaaag 
cactcaacca 
cttggatcat 
gattcttacc 
ggtttcctta 
gaattgttgc 
atcgtctaaa 
tgattatgga 
tcttagtcgc 
ggcagatata 
ttatcttgaa 
ttatgaccct 
gcttccttcc 
cacttactgt 
atatatgatg 
aaaaaaaa 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1438 


<210> 20 

<211> 405 

<212> PRT 

<213> Glycine max 

<400> 20 

Met Arg lie Gin Arg Phe Leu Ala Thr Leu Ala lie Thr Val Ser lie 
15 10 15 


Leu Leu Gly Asn 
20 


Gly Asn Pro Val Gin Cys Phe Asp Gly Gly Ser His 
25 30 


Gin Lys Gin Gin 

35 


His Ser Leu Cys Glu Giu Leu lie lie Pro Tyr Gly 
40 45 


Tyr Pro Cys Ser 
50 


Glu His Thr lie Gin Thr Lys Asp Gly Phe Leu Leu 
55 60 


Gly Leu Gin Arg 
65 


Val Ser Ser Ser Ser Ser Leu Arg Leu Arg Asn His 
70 75 80 


Gly Asp Gly Gly Pro Pro Val Leu Leu Leu His Gly Leu Phe Met Ala 
85 90 95 


Gly Asp Ala Trp 
100 


Phe Leu Asn Thr Pro Glu Gin Ser Leu Gly Phe lie 
105 110 


Leu Ala Asp His 
115 


Gly Phe Asp Val Trp Val Gly Asn Val Arg Gly Thr 
120 125 


Arg Trp Ser His 
130 


Gly His lie Ser Leu Leu Glu Lys Lys Lys Gin Phe 
135 140 


Trp Asp Trp Ser 
145 


Trp Gin Glu Leu Ala Leu Tyr Asp Val Ala Glu Met 
150 155 160 
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lie Asn Tyr lie Asn Ser Val Thr Asn Ser Lys He Phe Val Val Gly 
165 170 175 

His Ser Gin Gly Thr He He Ser Leu Ala Ala Phe Thr Gin Pro Glu 
180 185 190 

He Val Glu Lys Val Glu Ala Ala Ala Leu Leu Ser Pro He Ser Tyr 
195 200 205 

Leu Asp His Val Ser Ala Pro Leu Val Leu Arg Met Val Lys Met His 
210 215 220 

lie Asp Glu Met He Leu Thr Met Gly lie His Gin Leu Asn Phe Lys 
225 230 235 240 

Ser Glu Trp Gly Ala Ser Leu Leu Val Ser Leu Cys Asp Thr Arg Leu 
245 250 255 

Ser Cys Asn Asp Met Leu Ser Ser lie Thr Gly Lys Asn Cys Cys Phe 
260 265 270 

Asn Glu Ser Arg Val Glu Phe Tyr Leu Glu Gin Glu Pro His Pro Ser 
275 \ 280 285 

Ser Ser Lys Asn Leu Asn His Leu Phe Gin Met He Arg Lys Gly Thr 
290 295 300 

Tyr Ser Lys Tyr Asp Tyr Gly Lys Leu Lys Asn Leu lie Glu Tyr Gly 
305 310 315 320 

Lys Phe Asn Pro Pro Lys Phe Asp Leu Ser Arg lie Pro Lys Ser Leu 
325 330 335 

Pro Leu Trp Met Ala Tyr Gly Gly Asn Asp Ala Leu Ala Asd lie Thr 
340 345 350 

Asp Phe Gin His Thr Leu Lys Glu Leu Pro Ser Pro Pro Glu Val Val 
355 360 365 

Tyr Leu Glu Asn Tyr Gly His Val Asp Phe lie Leu Ser Leu Gin Ala 
370 375 380 

Lys Gin Asp Leu Tyr Asp Pro Met lie Ser Phe Phe Lys Ser Ser Gly 
385 390 395 400 

Lys Phe Ser Ser Met 
405 

<210> 21 
<211> 737 
<212> DNA 
<213> Zea mays 

<400> 21 

gcacgaggtt ttgtgccctt gatctatctg ttaaatttgg atcgcaggag gttgaactca 60 

tgacctttgg acagcctcgg ataggcaatc ctgcatttgc tgtatacttt ggtgaacaag 120 

tcccaagaac aatccgtgtg acccatcaga atgatattgt gccgcattta ccaccgtatt 180 

attattacct aggtgaatgg acataccacc acttcgctag agaggtttgg cttcatgaga 240 

gcatagatgg aaatgtagtt accagaaacg agacggtatg tgatgattct ggtgaagacc 300 

cgacctgtag caggtcggtc tatgggatga gcgtagcaga tcatcttgag tactatgatg 360 
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tcacactaca 
tatacaacta 
cacaagctct 
ttcattttca 
ataaggagag 
aagctccata 
taaaaaaaaa 


tgctgattca 
cgttcgtgaa 
agaatctatg 
ttttgtgtac 
gctcaccttt 
agattgtccg 
aaaaaaa 


agaggaacct 
gttgatggat 
tgactttgta 
agctcatgaa 
ttaaatgtgc 
cacaattcaa 


gtcaattcgt 
ccatcatcct 
tgccacggaa 
atgctgggcg 
cccctttgct 
tttgtgtata 


gattggtgca 
gtcaagatac 
tgcacccctg 
ctcctggagc 
caagtgagaa 
taaataatac 


gccaaccaag 
ccgcaagaac 
tacagtattt 
tctccagagg 
tcgtgcatgt 
tatgtgttac 


420 
480 
540 
600 
660 
720 
737 


<210> 22 

<211> 166 

<212> PRT 

<213> Zea mays 


<400> 22 

Thr Arg Phe Cys Ala 
1 5 

Val Glu Leu Met Thr 
20 

Ala Val Tyr Phe Gly 

35 

Gin Asn Asp He Val 
50 

Glu Trp Thr Tyr His 
65 

He Asp Gly Asn Val 
85 

Gly Glu Asp Pro Thr 
100 

Asp His Leu Glu Tyr 
115 

Thr Cys Gin Phe Val 
130 

Arg Glu Val Asp Gly 
145 


Leu Asp Leu Ser Val Lys 
10 

Phe Gly Gin Pro Arg He 
25 

Glu Gin Val Pro Arg Thr 
40 

Pro His Leu Pro Pro Tyr 
55 

His Phe Ala Arg Glu Val 
70 75 

Val Thr Arg Asn Glu Thr 
90 

Cys Ser Arg Ser Val Tyr 
105 

Tyr Asp Val Thr Leu His 
120 

He Gly Ala Ala Asn Gin 
135 

Ser He He Leu Ser Arg 
150 155 


Phe Gly Ser Gin Glu 
15 

Gly Asn Pro Ala Phe 
30 

He Arg Val Thr His 
45 

Tyr Tyr Tyr Leu Gly 
60 

Trp Leu His Glu Ser 
80 

Val Cys Asp Asp Ser 
95 

Gly Met Ser Val Ala 
110 

Ala Asp Ser Arg Gly 
125 

Val Tyr Asn Tyr Val 
140 

Tyr Pro Gin Glu Pro 
160 


Gin Ala Leu Glu Ser Met 
165 


<210> 23 

<211> 1434 

<212> DNA 

<213> Zea mays 

<220> 

<221> unsure 

<222> (226) 

<220> 

<221> unsure 

<222> (315) 
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<220> 

<221> unsure 

<222> (1306) 

<220> 

<221> unsure 

<222> (1349) 

<220> 

<221> unsure 

<222> (1359) 

<220> 

<221> unsure 

<222> (1368) 

<220> 

<221> unsure 

<222> (1373) 


<400> 23 

acccacgcgt ccgcccacgc gtccggctct ggaagcaggt tcagatttag cctgggtgcg 60 

tctgcaggtt ccggttcatg gagagatgga gcttgggtgc caaagtggta gctctcgcac 120 

tcttgctgtc tgctgcttct catggaagag agttgcctgt caagagtagt gaccgcagtt 180 

ttatctacaa ccatactctt gcaaagacgc ttgtggaata tgcatnagcg gtgtatatga 240 

cagatttaac cgctctgttt acgtggacat gctcaagatg caatgacttg actcaaggat 300 

tcgagatgag atccntaatt gttgatgtgg agaaactgct tgcaggcatt gttggtgtag 360 

atcatagtct gaattcgata attgttgcaa tcaggggaac tcaagagaac agtgtacaga 4 20 

attggataaa agacttgata tggaagcagc ttgatctaag tnatccaaac atgccaaatg 480 

caaaggtgca cagtggattt ttctcctcgt acaacaatac aattttgcgt ctagctatca 540 

caagtgctgt gcacaaggca agaaagtcat atggagatat caatgtcara gtgacaggcc 600 

actcgatggg aggagctatg gcttcttttt gcgcgctcga tcttgctatg aagcttggag 660 

gtggcagtgt gcaactcatg acttttgggc agcctcgtgt tggcaatgct gcattcgcct 720 

catacttcgc caaatatgta cccaacacaa ttcgagtgac acacgggcat gatattgtgc 780 

cacatttgcc accttatttc tcctttcttc cccagctgac ataccaccat ttcccaagag 840 

aggtatgggt ccaggattct gatggcaaca caactgaacg gatttgtgac gacagcggtg 900 

aagacccaga ttgttgcagg tgcatctcca tgttcggctt gaggattcag gaccattcac 960 

ttacctagga gttgatatgg aagcggacga ctggagcacc tgtagaatca tcacagctca 1020 

aagggttcag cagttccgac tggagctagc ggcaacatca tgatgaccaa gcacgatatc 1080 

gacgtctcca tcgttgaacc cagtgtacaa aacagattgg agcagttcta gataggcgga 1140 

acattcgttt tgtccagatt cagagaagca acagcctctt gttaatgcag tggaaattgt 1200 

tcagagacga aggcgaccct tgtttctcta ttagttcggt cagaatggga tgttctttca 1260 

gtcaaggaat aaatcgggag catctcttgt aacaaagaga tcaganatga tgtcataagg 1320 

aaaatcatag gacgtttatg ctgattggna ggattgctnt ggtaatanat gancatgtaa 1380 

cttcatgctt attcagaaca tagaccagct actgaaatta gttacgaaaa aaaa 1434 

<210> 24 
<211> 296 
<212> PRT 
<213> Zea mays 


<220> 

<221> UNSURE 

<222> (50) 

<220> 

<221> UNSURE 

<222> (80) 


18 


WO 99/55883 PC17US99/09280 


<220> 

<221> UNSURE 
<222> (129) 

<400> 24 

Met Glu Arg Trp Ser Leu Gly Ala Lys Val Val Ala Leu Ala Leu Leu 
15 10 15 

Leu Ser Ala Ala Ser His Gly Arg Glu Leu Pro Val Lys Ser Ser Asp 
20 25 30 

Arg Ser Phe lie Tyr Asn His Thr Leu Ala Lys Thr Leu Val Glu Tyr 
35 40 45 

Ala Xaa Ala Val Tyr Met Thr Asp Leu Thr Ala Leu Phe Thr Trp Thr 
50 55 60 

Cys Ser Arg Cys Asn Asp Leu Thr Gin Gly Phe Glu Met Arg Ser Xaa 
65 70 75 80 

He Val Asp Val Glu Lys Leu Leu Ala Gly He Val Gly Val Asp His 
85 90 95 

Ser Leu Asn Ser He He Val Ala He Arg Gly Thr Gin Glu Asn Ser 
100 105 HO 

Val Gin Asn Trp He Lys Asp Leu He TrD Lys Gin Leu Asp Leu Ser 
115 120 " 125 

Xaa Pro Asn Met Pro Asn Ala Lys Val His Ser Gly Phe Phe Ser Ser 
130 135 140 

Tyr Asn Asn Thr He Leu Arg Leu Ala He Thr Ser Ala Val His Lys 
145 150 155 160 

Ala Arg Lys Ser Tyr Gly Asp lie Asn Val He Val Thr Gly His Ser 
165 170 175 

Met Gly Gly Ala Met Ala Ser Phe Cys Ala Leu Asp Leu Ala Met Lys 
180 185 190 

Leu Gly Gly Gly Ser Val Gin Leu Met Thr Phe Gly Gin Pro Arg Val 
195 200 205 

Gly Asn Ala Ala Phe Ala Ser Tyr Phe Ala Lys Tyr Val Pro Asn Thr 
210 215 220 

He Arg Val Thr His Gly His Asp He Val Pro His Leu Pro Pro Tyr 
225 230 235 240 

Phe Ser Phe Leu Pro Gin Leu Thr Tyr His His Phe Pro Arg Glu Val 
245 250 255 

Trp Val Gin Asp Ser Asp Gly Asn Thr Thr Glu Arg He Cys Asp Asp 
260 265 270 

Ser Gly Glu Asp Pro Asp Cys Cys Arg Cys lie Ser Met Phe Gly Leu 
275 280 285 

Arg lie Gin Asp His Ser Leu Thr 
290 295 
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<210> 25 

<211> 1560 

<212> DNA 

<213> Zea mays 

<220> 

<221> unsure 
<222> (601) 

<400> 25 

tcccactgaa ccgggaggcg cccaacttcc ggtccatgat cgggatgatc gacgggagga 60 

cggagctgaa gccgctgcca gccagcggcg gcccagagga ccggcggctg caggtggtgg 120 

gcgtctccgc cgccgccgga tccggcgcga tgactactac ttggacgtgg agagcggcag 180 

gcggcggtgc cgctggtgca gcagcagtac gtgaacgggc ggctcgtccg cctccgcacc 240 

ttctccgtgt tcgaggtgag catgatggcc gccaagatcg cctacgagaa cgccgcctac 300 

atcgagaacg tcgtcaacaa cgtctggaag ttccacttcg tggggttcta caactgctgg 360 

aacaagttcg tgggcgacca cacgacgcag gcgttcgtgt tcaccgacaa ggcaagagga 420 

cgcgagcgtg gtggtggtgt cgttccgggg cacggagccc ttcaacatgc gggactggtc 480 

cacggacgta aacctgtcgt ggctgggcat gggcgagctg ggccacgtcc acgtcggctt 54 0 

cctcaaggcg ctgggcctgc aggaggagga cggcaaggac gccacgcggg cgttccccaa 600 

nggcgccccc aacgccgtcc cgggcaagcc gctggcctac tacgcgctgc gcgaggaggt 660 

ccagaagcag ctgcagaagc acccgaacgc caacgtcgtg gtcaccggcc acagcctcgg 720 

cgccgcgctg gcgaccatct tcccggcgct gctggcgttc cacggggagc ggggcgtcct 780 

ggaccgcctg ctctccgtgg tcacctacgg gcagccgcgc gtgggcgaca aggtgttcgc 84 0 

gggctacgtg cgcgccaacg tgcccgtgga gccgctccgg gtggtgtacc gctacgacgt 900 

ggtcccgcgc gtgcccttcg acgcgccgcc cgtcgccgac ttcgcgcacg gcggcacctg 960 

cgtctacttc gacggctggt acaagggccg cgagatcgcc aagggcggcg acgcgcccaa 1020 

caagaactac ttcgacccca ggtacctgct gtccatgtac ggcaacgcgt ggggggacct 108G 

cttcaagggc gccttcctgt gggccaagga gggcaaggac taccgcgagg gcgccgtctc 114 0 

gctgctctac cgcgccaccg gcctgctcgt gcccggcatc gcgtcgcaca gccccaggga 1200 

ctacgtcaac gccgtccgcc tcggcagcgt cgcctcggcg tagcttttgg attgcatgtt 1260 

cgtttccatg catgtgtatc attgcatgca ataattggat gaaataaaca gcaataagct 1320 

tcatcagtat tattattgtt gttgttgaat atatgcatcc tctcctctct atatagaatt 1380 

atagatacat gaggcctggc cggccgcgca cgttgctgaa cagttgaagc gcttcccaaa 14 40 

aaaaaatgta tcaactgtga agcatatata tccatgcatg catgtgtgcc cgaaattttt 1500 

gtttttaaaa aaaaaaaaaa aaaaaaaaac aaaaaaaaaa aaaaaacaaa aaaaaaaaaa 1560 

<210> 26 

<211> 258 

<212> PRT 

<213> Zea mays 

<220> 

<221> UNSURE - 
<222> (45) 

<400> 26 

Met Arg Asp Trp Ser Thr Asp Val Asn Leu Ser Trp Leu Gly Met Gly 
IS 10 15 

Glu Leu Gly His Val His Val Gly Phe Leu Lys Ala Leu Gly Leu Gin 
20 25 30 

Glu Glu Asp Gly Lys Asp Ala Thr Arg Ala Phe Pro Xaa Gly Ala Pro 
35 40 45 

Asn Ala Val Pro Gly Lys Pro Leu Ala Tyr Tyr Ala Leu Arg Glu Glu 
50 55 60 
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Val Gin Lys Gin Leu Gin Lys His Pro Asn Ala Asn Val Val Val Thr 
65 " 70 75 80 

Gly His Ser Leu Gly Ala Ala Leu Ala Thr lie Phe Pro Ala Leu Leu 
85 90 95 

Ala Phe His Gly Glu Arg Gly Val Leu Asp Arg Leu Leu Ser Val Val 
100 105 110 

Thr Tyr Gly Gin Pro Arg Val Gly Asp Lys Val Phe Ala Gly Tyr Val 
115 120 t 125 

Arg Ala Asn Val Pro Val Glu Pro Leu Arg Val Val Tyr Arg Tyr Asp 
130 135 140 

Val Val Pro Arg Val Pro Phe Asp Ala Pro Pro Val Ala Asp Phe Ala 
145 150 155 160 

His Gly Gly Thr Cys Val Tyr Phe Asp Gly Trp Tyr Lys Gly Arg Glu 
165 170 175 

lie Ala Lys Gly Gly Asp Ala Pro Asn Lys Asn Tyr Phe Asp Pro Arg 
180 185 190 

Tyr Leu Leu Ser Met Tyr Gly Asn Ala Trp Gly Asp Leu Phe Lys Gly 
195 200 205 

Ala Phe Leu Trp Ala Lys Glu Gly Lys Asp Tyr Arg Glu Gly Ala Val 
210 215 220 

Ser Leu Leu Tyr Arg Ala Thr Gly Leu Leu Val Pro Gly lie Ala Ser 
225 230 235 240 

His Ser Pro Arg Asp Tyr Val Asn Ala Val Arg Leu Gly Ser Val Ala 
245 250 255 


Ser Ala 


<210> 27 

<211> 432 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> unsure 

<222> (7) 

<220> 

<221> unsure 

<222> (15) 

<220> 

<221> unsure 

<222> (27) 

<220> 

<221> unsure 

<222> (38) 
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<220> 

<221> unsure 

<222> (50) 

<220> 

<221> unsure 

<222> (94) 

<220> 

<221> unsure 

<222> (99) 

<220> 

<221> unsure 

<222> (103) 

<220> 

<221> unsure 

<222> (105) 

<220> 

<221> unsure 

<222> (117) 


<400> 27 

catagtnata 

aggaagacat 

tttagcatcg 

ctcatgactt 

caagtcccga 

tatttttgct 

gagaccatag 

gacccaacat 


atacnaacag 
atggaaggct 
ttcttgtgcc 
ttggacagcc 
gaacaatccg 
accttggcga 
taggaaatgt 
gc 


ttgcggncat 
acctataaat 
cttgacctct 
tcgggtaggc 
tgtgacccat 
atggacatat 
agttactagg 


tgagattntt 
gttntaggnt 
cttgttaagt 
aatccttctt 
cagaatgaca 
caccacttct 
aatgagacca 


ggaaatctgn 
cantncgatg 
atggatcgca 
ttgctgcgta 
ttgtcccaca 
cgagagaggt 
tctgtgatgg 


tcggtgggca 60 
ggagggncct 120 
ggaagttcaa 180 
cttcagtgac 240 
cttgccacca 300 
ttggcttcat 360 
atcaggcgag 420 
432 


<210> 28 

<211> 106 

<212> PRT 

<213> Oryza sativa 

<400> 28 

Gly Pro Phe Ser He Val Leu Val Pro Leu Thr Ser Leu Val Lys Tyr 
1 5 10 15 

Gly Ser Gin Glu Val Gin Leu Met Thr Phe Gly Gin Pro Arg Val Gly 
20 25 30 

Asn Pro Ser Phe Ala Ala Tyr Phe Ser Asp Gin Val Pro Arg Thr He 
35 40 45 

Arg Val Thr His Gin Asn Asp He Val Pro His Leu Pro Pro Tyr Phe 
50 55 60 

Cys Tyr Leu Gly Glu Trp Thr Tyr His His Phe Ser Arg Glu Val Trp 
65 7 0 75 80 

Leu His Glu Thr He Val Gly Asn Val Val Thr Arg Asn Glu Thr He 
85 90 95 

Cys Asp Gly Ser Gly Glu Asp Pro Thr Cys 
100 105 
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<210> 29 

<211> 1234 

<212> DNA 

<213> Glycine max 


<400> 29 

ccactggaag 

ccacaaccca 

ctttccgagg 

ggtacgcatt 

gaaaaatgtg 

ctatgttatt 

cattacgggt 

gcatgatgag 

agttggagat 

gtattgcagg 

cttgctcttc 

tctcgaagaa 

caatgctgtt 

tagagaagga 

tgctcacggt 

taaagcagat 

gtcatgcaga 

acttgattat 

agtttgtcgt 

ctttggtgca 

aaaaaaaaaa 


atggaattcg 
agtcttgatt 
aacggaaccc 
cccggcattg 
gggtggccta 
agggacattc 
catagtttgg 
aagttgctga 
gaagcatatg 
tttgtttatt 
aagcactttg 
gagccgaata 
ttggaactaa 
tggtttctct 
ccacaagatt 
tgatgtgtcc 
ctaaaattta 
gtatcatgtg 
cagtacgagt 
tggatttatc 
aaaaaaaaaa 


tgagattttt 
gttttggaca 
tttgatgcag 
gaaaaatgca 
aggagattca 
taaggaaagg 
gtggagcact 
ttgagaggtt 
cacagtatat 
gcaatgacat 
ggatctgcct 
agaactattt 
taaggagctt 
ttagtttcag 
atattaattc 
gtatacatga 
cataatcaag 
aagaatagtt 
tattgtatag 
attaatgaga 
aaaaaaaaaa 


tgattgctgg 

agcatgagaa 

atgcatggtg 

tggtggcttc 

aagggatgaa 

tttgagtgaa 

cgcaatcttg 

ggaagggatc 

gagacaaaaa 

agttccgagg 

tttctttaac 

ctcgccatgg 

taccatagcg 

gttggttggt 

cactcttctg 

tcattccata 

atttttagtt 

atgtatcata 

taattaataa . 

tcaatgtgaa 

aaaa 


gaatgatttt 
ccgcgatact 
cactgacctt 
atgaaagcct 
aatcttcccc 
aatcctaatg 
taccctacga 
tacacgtttg 
ttgagggaaa 
ttgccctatg 
aggcgctatg 
tgtgtgatac 
tacaaaaatg 
ctgctgattc 
ggatcaattg 
ccactacgta 
ttagaaaaaa 
atgatcatga 
gctaggttta 
gtttgtttat 


caagaaaagg 
tatgtggtag 
gacatctcat 
tagggctaca 
cgttggccta 
caaagtttat 
tcatgttctt 
ggcaaccaag 
attctatcag 
atgataagga 
aactcaggat 
ccatgatgtt 
gacctcacta 
ctggcttacc 
aaaaacattt 
catgtgtatg 
tggtaataac 
ataatataac 
aagttgtttc 
ttcaaaaaaa 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1234 


<210> 30 

<211> 246 

<212> PRT 

<213> Glycine max 

<400> 30 

His Leu Met Val Arg He Pro Gly He Gly Lys Met His Gly Gly Phe 
15 10 15 

Met Lys Ala Leu Gly Leu Gin Lys Asn Val Gly Trp Pro Lys Glu He 
20 25 30 

Gin Arg Asp Glu Asn Leu Pro Pro Leu Ala Tyr Tyr Val He Arg Asp 
35 40 45 

He Leu Arg Lys Gly Leu Ser Glu Asn Pro Asn Ala Lys Phe He He 
50 55 60 

Thr Gly His Ser Leu Gly Gly Ala Leu Ala He Leu Tyr Pro Thr He 
65 70 75 80 

Met Phe Leu His Asp Glu Lys Leu Leu He Glu Arg Leu Glu Gly He 
85 " 90 95 

Tyr Thr Phe Gly Gin Pro Arg Val Gly Asp Glu Ala Tyr Ala Gin Tyr 
100 105 110 

Met Arg Gin Lys Leu Arg Glu Asn Ser lie Arg Tyr Cys Arg Phe Val 
115 120 125 


Tyr Cys Asn Asp He Val Pro Arg Leu Pro Tyr Asp Asp Lys Asp Leu 
130 135 140 
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Leu Phe Lys His Phe Gly lie Cys Leu Phe Phe Asn Arg Arg Tyr Glu 
145 150 155 160 

Leu Arg lie Leu Glu Glu Glu Pro Asn Lys Asn Tyr Phe Ser Pro Trp 
165 170 175 

Cys Val lie Pro Met Met Phe Asn Ala Val Leu Glu Leu lie Arg Ser 
180 185 190 

Phe Thr lie Ala Tyr Lys Asn Gly Pro His Tyr Arg Glu Gly Trp Phe 
195 200 205 

Leu Phe Ser Phe Arg Leu Val Gly Leu Leu He Pro Gly Leu Pro Ala 
210 215 220 

His Gly Pro Gin Asp Tyr He Asn Ser Thr Leu Leu Gly Ser He Glu 
225 230 235 240 

Lys His Phe Lys Ala Asp 
245 

<210> 31 

<211> 490 

<212> DNA 

<213> Glycine max 

<400> 31 

gcacgaggag agatggccta aagaaattga aaccgatgag aaccgtccac gtgtctatta 60 

ctccataagg gatttgctaa agaagtgttt gaatagaaat gataaagcaa agtttattct 120 

tacgggtcat agtcttggtg gagcacttgc aattcttttt cccgctatgc taattttgca . 180 

tgctgagaca tttcttttgg aaaggcttga aggggtgtac acatttggac agcctagggt 24 0 

tggagatgaa acatttgcta aatacatgga aaatcaattg aaacattatg gcattaagta 300 

ttttaggttt gtttactgca acgatattgt tcctaggttg ccctttgatg aagatatcat 360 

gaaatttgag cattttggga catgtcttta ttatgacagg agctatacat gcaaggtaca 420 

tatataagta tttaattttt ttgattcatg catatattcg tcattgtaat caactttttt 480 

ttttctgggg 4 90 

<210> 32 

<211> 141 

<212> PRT 

<213> Glycine max 

<400> 32 

His Glu Glu Arg Trp Pro Lys Glu He Glu Thr Asp Glu Asn Arg Pro 
15 10 15 

Arg Val Tyr Tyr Ser He Arg Asp Leu Leu Lys Lys Cys Leu Asn Arg 
20 25 30 

Asn Asp Lys Ala Lys Phe He Leu Thr Gly His Ser Leu Gly Gly Ala 
35 40 45 

Leu Ala He Leu Phe Pro Ala Met Leu He Leu His Ala Glu Thr Phe 
50 55 60 

Leu Leu Glu Arg Leu Glu Gly Val Tyr Thr Phe Gly Gin Pro Arg Val 
65 70 75 80 

Gly Asp Glu Thr Phe Ala Lys Tyr Met Glu Asn Gin Leu Lys His Tyr 
85 90 95 
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Gly lie Lys Tyr Phe Arg Phe Val Tyr Cys Asn Asp lie Val Pro Arg 
100 105 HO 

Leu Pro Phe Asp Glu Asp lie Met Lys Phe Glu His Phe Gly Thr Cys 
115 120 125 

Leu Tyr Tyr Asp Arg Ser Tyr Thr Cys Lys Val His lie 
130 135 140 

<210> 33 
<211> 774 
<212> DNA 

<213> Triticum aestivum 
<400> 33 

gcacgagaat attcccatca tggtgacagg acattccatg ggaggggcca tggcttcgtt 60 

ttgtgccctt gatcttattg tcaactatgg gttaaaggac gtgaccctgc tgacatttgg 120 

gcaacctcgg attggtaatg ctgtgtttgc tacccacttt aagaaatact tgccaaacgc 180 

aattcgagtt accaacgcac atgatattgt gcctcatcta cccccgtact accagtactt 24 0 

cccacagaat acctaccatc atttcccacc agaggtttgg gttcataaca ttggactcga 300 

tagcctacta tacccgatcg agcacatctg tgatcattct ggagaaagac cccacttgca 360 

gcaggccctt ggttggaaat agcgtccagg cccatacccc ctttcttggc tccagcatcc 420 

atcccgagtc gcgcggatca tccagaatcg tcacggatga caatatgctc aggcacaaag 480 

ttgcccctgt agacggtgtt attgtcttct cgaagcagcc tggtttatca gttggtcagc 54 0 

tactcagtac acagtaaaca agctcaagat tacatggatt tattttgatg tttttttttg 600 

ccaaagaaca atattcttgt tggcaatcaa agcactatct catgtatata tacgcgtgtg 660 

atcctggctg gattaaatta tcctagctga gggtgtattt ctgaaatgta caaacatatc 720 

tatgctgatt aaaaaaaaaa aaaaaaatac ttgaggcggc cccgtaccaa aaat 774 

<210> 34 

<211> 126 

<212> PRT 

<213> Triticum aestivum 

<400> 34 

His Glu Asn He Pro He Met Val Thr Gly His Ser Met Gly Gly Ala 
1 5 10 15 

Met Ala Ser Phe Cys Ala Leu Asp Leu He Val Asn Tyr Gly Leu Lys 
20 25 30 

Asp Val Thr Leu Leu Thr Phe Gly Gin Pro Arg He Gly Asn Ala Val 
35 40 45 

Phe Ala Thr His Phe Lys Lys Tyr Leu Pro Asn Ala He Arg Val Thr 
50 55 60 

Asn Ala His Asp He Val Pro His Leu Pro Pro Tyr Tyr Gin Tyr Phe 
65 ™ 75 80 

Pro Gin Asn Thr Tyr His His Phe Pro Pro Glu Val Trp Val His Asn 
85 90 95 

He Gly Leu Asp Ser Leu Leu Tyr Pro He Glu His He Cys Asp His 
100 105 no 

Ser Gly Glu Arg Pro His Leu Gin Gin Ala Leu Gly Trp Lys 
115 120 125 

<210> 35 
<211> 398 
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<212> PRT 

<213> Canis familiaris 
<400> 35 

Met Trp Leu Leu Leu Thr Ala Ala Ser Val lie Ser Thr Leu Gly Thr 
15 10 15 

Thr His Gly Leu Phe Gly Lys Leu His Pro Thr Asn Pro Glu Val Thr 
20 25 30 

Met Asn lie Ser Gin Met lie Thr Tyr Trp Gly Tyr Pro Ala Glu Glu 
35 40 45 

Tyr Glu Val Val Thr Glu Asp Gly Tyr He Leu Gly lie Asp Arg He 
50 55 60 

Pro Tyr Gly Arg Lys Asn Ser Glu Asn He Gly Arg Arg Pro Val Ala 
65 70 75 80 

Phe Leu Gin His Gly Leu Leu Ala Ser Ala Thr Asn Trp He Ser Asn 
85 90 95 

Leu Pro Asn Asn Ser Leu Ala Phe He Leu Ala Asp Ala Gly Tyr Asp 
100 105 110 

Val Trp Leu Gly Asn Ser Arg Gly Asn Thr Trp Ala Arg Arg Asn Leu 
115 120 125 

Tyr Tyr Ser Pro Asp Ser Val Glu Phe Trp Ala Phe Ser Phe Asp Glu 
130 135 140 

Met Ala Lys Tyr Asp Leu Pro Ala Thr He Asp Phe He Leu Lys Lys 
145 150 155 160 

Thr Gly Gin Asp Lys Leu His Tyr Val Gly His Ser Gin Gly Thr Thr 
165 170 175 

He Gly Phe He Ala Phe Ser Thr Asn Pro Lys Leu Ala Lys Arg He 
180 185 190 

Lys Thr Phe Tyr Ala Leu Ala Pro Val Ala Thr Val Lys Tyr Thr Glu 
195 200 205 

Thr Leu Leu Asn Lys Leu Met Leu Val Pro Ser Phe Leu Phe Lys Leu 
210 215 220 

He Phe Gly Asn Lys He Phe Tyr Pro His His Phe Phe Asp Gin Phe 
225 230 235 240 

Leu Ala Thr Glu Val Cys Ser Arg Glu Thr Val Asp Leu Leu Cys Ser 
245 250 255 

Asn Ala Leu Phe He He Cys Gly Phe Asp Thr Met Asn Leu Asn Met 
260 265 270 

Ser Arg Leu Asp Val Tyr Leu Ser His Asn Pro Ala Gly Thr Ser Val 
275 280 285 

Gin Asn Val Leu His Trp Ser Gin Ala Val Lys Ser Gly Lys Phe Gin 
290 295 300 
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Ala Phe Asp Trp Gly Ser Pro Val Gin Asn Met Met His Tyr His Gin 
305 310 315 320 

Ser Met Pro Pro Tyr Tyr Asn Leu Thr Asp Met His Val Pro lie Ala 
325 330 335 

Val Trp Asn Gly Gly Asn Asp Leu Leu Ala Asp Pro His Asp Val Asp 
340 345 350 

Leu Leu Leu Ser Lys Leu Pro Asn Leu lie Tyr His Arg Lys lie Pro 
355 360 365 

Pro Tyr Asn His Leu Asp Phe lie Trp Ala Met Asp Ala Pro. Gin Ala 
370 375 380 

Val Tyr Asn Glu lie Val Ser Met Met Gly Thr Asp Asn Lys 
385 390 395 

<210> 36 
<211> 403 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 36 

Met Trp Arg Phe Ala Val Phe Leu Ala Ala Phe Phe Val Gin Asp Val 
15 10 15 

Val Gly Ser His Gly Asp Pro Glu Leu His Met Thr Thr Pro Gin lie 
20 25 30 

He Glu Arg Trp Gly Tyr Pro Ala Met He Tyr Thr Val Ala Thr Asp 
35 40 45 

Asp Gly Tyr He Leu Glu Met His Arg He Pro Phe Gly Lys Thr Asn 
50 55 60 

Val Thr Trp Pro Asn Gly Lys Arg Pro Val Val Phe Met Gin His Gly 
65 70 75 80 

Leu Leu Cys Ala Ser Ser Asp Trp Val Val Asn Leu Pro Asp Gin Ser 
85 90 95 

Ala Gly Phe Leu Phe Ala Asp Ala Gly Phe Asp Val Trp Leu Gly Asn 
100 105 110 

Met Arg Gly Asn Thr Tyr Ser Met Lys His Lys Asp Leu Lys Pro Ser 
115 120 125 

His Ser Ala Phe Trp Asp Trp Ser Trp Asp Glu Met Ala Thr Tyr Asp 
130 135 140 

Leu Asn Ala Met He Asn His Val Leu Glu Val Thr Gly Gin Asp Ser 
145 150 155 160 

Val Tyr Tyr Met Gly His Ser Gin Gly Thr Leu Thr Met Phe Ser His 
165 170 175 

Leu Ser Lys Asp Asp Gly Ser Phe Ala Lys Lys He Lys Lys Phe Phe 
180 185 190 
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Ala Leu 


Phe Phe 
210 


Ala Pro He Gly 
195 

Ala Asn Tyr Phe 


Ser Val Lys His He Lys Gly Phe Leu Ser 
200 205 

Ser Leu Glu Phe Asp Gly Trp Phe Asp He 
215 220 


Phe Gly 
225 


Ala Gly 
Ala Lys Asp He 


Glu Phe Leu Pro Asn Asn Trp Ala Met Lys Leu Ala 

230 235 240 

Cys Gly Gly Leu Lys Val Glu Ala Asp Leu Cys Asp 

245 250 255 


Asn Val Leu Phe Leu He Ala Gly Pro Glu Ser Asp Gin Trp Asn Gin 
260 265 270 


Thr Arg 


Gin Asn 
290 


Val Pro Val Tyr 
275 

He Val His Trp 


Ala Thr His Asp Pro Ala Gly Thr Ser Thr 

280 285 

Met Gin Met Val His His Gly Gly Val Pro 
295 300 


Ala Tyr Asp Trp Gly Thr Lys Thr Asn Lys Lys Lys Tyr Gly Gin Ala 
305 310 315 320 


Asn Pro Pro Glu 


Leu Tyr 
Asp Tyr 


Trp Ser 
340 

Leu Leu 
355 


His Leu 
370 


Pro Asp 
Asp He 
Leu Gly Lys 


Pro Asp 
385 


Tyr Asp Phe Thr Ala He Lys Gly Thr Lys He Tyr 
325 330 335 

Asp Ala Asp Trp Leu Ala Asp Thr Pro Asp Val Pro 
345 350 

Thr Arg Leu Asn Pro Ala He Val Ala Gin Asn Asn 
360 365 

Tyr Asn His Leu Asp Phe Thr Trp Gly Leu Arg Ala 
375 380 

Tyr Arg Pro Ala He Lys Leu Cys Thr Asp Asp Tyr 
390 395 400 
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